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Abstract 

The mathematical problem of the static storage optimisation is formulated and solved by 
means of a variational analysis. The solution obtained in implicit form is shedding light on the 
most important features of the optimal exercise strategy. We show how the solution depends 
on different constraint types including carry cost and cycling constraint. We investigate the 
relation between intrinsic and stochastic solutions. In particular we give another proof that 
the stochastic problem has a "bang-bang" optimal exercise strategy. We also show why the 
optimal stochastic exercise decision is always close to the intrinsic one. 

In the second half we develop a perturbation analysis to solve the stochastic optimisation 
problem. The obtained approximate solution allows us to estimate the time value of the 
storage option. In particular we find an answer to rather academic question of asymptotic 
time value for the mean reversion parameter approaching zero or infinity. We also investigate 
the differences between swing and storage problems. The analytical results are compared with 
numerical valuations and found to be in a good agreement. 

1 Introduction 

The problem of storage optimisation is driven by the necessity of the storage owners to optimise 
their expenses and maximise a potential profit which can be gained by operating the storage. There 
are plenty of real world storage examples: Gas storage, Oil storage, Hydro power plant, Coal stock, 
etc. By following some clever strategy - buying the underlying commodity cheep, storing it, and 
selling as the prices go up - the storage owner can make a profit. Thus, there is a need for the 
exercise strategy optimisation - such an exercise rule, which allows to maximise the profit and 
minimise the risks. 

One of the most popular ways of the numerical storage optimisation is the "dynamic programming" 
algorithm. It allows to treat both - intrinsic (when the prices are supposed to be frozen) as well 
as stochastic (when the prices are supposed to follow some stochastic process) problems. Usually 
the numerical solution provides the answers to the two most important questions: what is the 
optimal exercise decision now given the current state (prices and volume level of the storage), and 
what is the expected profit, provided we follow the optimal exercise strategy. The next level of 
sophistication is to provide the hedge strategy - a portfolio of derivatives (futures, options or any 
other financial instruments) which would minimise the financial risks. Of course, financial risk is 
only a feature of the stochastic problem, as the solution of the intrinsic problem is deterministic. 

In this paper we to develop an analytical approach to the storage optimisation problem. In the 
Sec. [3] we consider an intrinsic problem which deals with predefined deterministic price curve. 
Section 13.71 considers different special cases of constraints. Sec. [4] is devoted to the stochastic 
problem, where we make use of perturbation theory to solve the stochastic problem, and derive an 
estimate of the stochastic time value. In the sections [4 . 1 01 and 14 . 1 1 1 we make an example calculation 
of the time value of a simple storage and swing options and compare the results with numerical 
evaluation. We discuss the results in Sec [5] 
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2 Problem formulation 



The storage problem can be formulated as follows. The storage option holder is given a right to 
store some amount of underlying (let it be for simplicity gas) in a (virtual) storage facility. At 
every time moment the option holder may "do nothing" , inject or release the gas from the storage. 
Every time the gas is injected into the storage it must be bought on the market. Likewise every 
time the gas is released from the storage, it is sold on the market. Since the market price of gas 
changes with time, this may lead to a non-trivial cash-flow. The injection and release process 
must satisfy some operational constraints (for instance maximum injection/release rates, storage 
capacity, etc.), specified as boundary conditions. Every exercise profile (trajectory in the time- 
volume space) yields a different profit. In this sense the profit becomes a functional on the exercise 
trajectory. The aim of the storage option holder is to maximise the profit by choosing an optimal 
injection/release strategy. The problem can thus be formulated in terms of variational analysis - 
an optimal trajectory is the one delivering maximum of the profit functional. 

If the market prices are known in advance and never change, such a problem is deterministic. The 
possible profit is bound from above and from below, and thus there exists a trajectory such that no 
other trajectory yields higher profit. The maximal profit of the static problem is called "intrinsic 
value" . In the current section we investigate the deterministic problem by means of variational 
analysis. 

Let the storage time span be t £ [0,T e ]. Let F(t) be the market price of gas by the time t. The 
curve F(t) is called the forward price curve, since it is observed on the market prior the time 
t = and contains the information about the prices of gas with delivery in the future. By the 
definition of the static problem the forward curve never changes, and hence it does not depend on 
the observation time. 

Let q(t) be the amount of gas in the storage by the time t. The curve q(t) defines the exercise 
trajectory. The initial and terminal conditions are 

g(0) - Qstan ; (2.1) 

q(T e ) = Qend ■ (2.2) 



The cash flow (per unit time) resulting from trading the gas according to the exercise strategy q(t) 
is given by 



-q(t)F(t). (2.3) 

An additional cash flow results from taking into account the injection/release costs (operating 
costs). Let us designate "y(q(t)) the operating costs per unit time. The terminal profit is given by 
the cumulative cash flow. We thus introduce the target functional 

So[q(t)\ = - [ ' U(t) F(t) + 7 (g(i))l dt . (2.4) 

The value of this functional on the trajectory q(t) gives the storage value conditional on that 
trajectory. We have used here subscript to indicate the unmodified action integral. In the 
next section we will introduce a modified action integral S[q(t)], which includes additional terms 
intended to enforce the operational constraints. Running ahead we notice that on any trajectory 
q(t) allowed by constraints the values of the modified and unmodified functionals coincide. 

Using "physical" terminology we introduce the (unmodified) Lagrangian L corresponding to the 
target functional So a s 

L = -q(t) F(t) - 7 (q(t)) . (2.5) 



As mentioned above, the target functional is bound from above and from below, and hence there 
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must exist such trajectories on which the functional achieves its maximum and minimu nfl. We 
make use of variational analysis to search the extremal trajectory, i.e. the trajectory on which the 
first variation of the target functional vanishes. Once the extremal trajectory is found, one has 
to make sure that it delivers the maximum of the functional. It could be done by evaluating the 
second variation and checking its sign. 

The operational constraints may differ for different storage option types. Below we consider some 
typical constraints, which can be classified as local. Local property implies that a constraint at 
time t can be expressed in terms of state variables and their derivatives q(t), q(t), F(t), F(t), ... at 
time t. 

We consider the following two operational constraints: 

1. The volume q(t) is allowed to be within the interval 

q(t) G [Qmin(t),Qmax(M, (2-6) 
where the boundaries Q m in(t),Q ma x(t) are time dependent. 

2. The injection/release rate q{t) is bounded to 

Train ? ^max\ ■ 

(2.7) 

Generally the maximal injection/release rates may depend on the time and volume: r = 
r(t,q(t)). Below we only consider the constant injection/release rates. The special case of 
volume dependent rates will be considered in the Sec. 13.7.41 

One of the most common examples of nonlocal constraints is the so called cycle constraint. It can 
be formulated as follows. One introduces an intake cycle variable as 

c(T) = jf q(t)6(q(t))dt; where 8{x) = j ^ (2.8) 

which has a meaning of total injected volume by the time T . The cycle constraint requires that 
the terminal value c(T e ) does not exceed certain threshold 

c(-^e) — Cmax ■ (2-9) 

Similarly one introduces a release cycle constraint. 

Most of the scope of this paper is not dealing with non-local constraints. However we will return 
to the cycle constraint below in the section 13.7.31 



3 Solution of deterministic problem 
3.1 Penalty functions 

To restrict q(t) from going beyond the range [Q m im Qmax], we can introduce a parametrised penalty 
function — <j)[q(t), N^] and add it to the unmodified Lagrangian (12.5[) 

L = L - <f)(q) = -qz - j(q) - <p{q). (3.1) 

do not conduct a thorough analysis of existence and uniqueness of the solution of the variational problem. 
However we point out that the class of functions q(t) on which the target functional is defined and could reach 
maximum should be rather broad. In particular it must include all continuous locally integrable functions. Other 
functions under the functional integral are allowed to be discontinuous. We will also use the concept of convergence 
of functions, which we will always understand as a weak convergence, i.e. i/>k — > ip if S^fc] — > S[ij>]. 
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The penalty function is any smooth function, which is in the limit — > oo approaches zero within 
the interval (QmimQmax) and positive infinity otherwise. A particular example of this function 
could be 



a(q(t) - b) 



2N S 



with a = 



Q, 



b = 



Qmax + Q 



A similar penalty function — ip[q(t), N^] can be introduced to restrict q(t) from going beyond the 
interval [r m i n , r max \. In the limit N$ — > oo it approaches zero within (r TO j n , r max ) and plus infinity 
otherwise. The modified Lagrangian becomes 



L = L - <f>(q) - ip(q) 



qF + cj){q) + T/j(q) + 7 (j) 



It corresponds to the following modified action integral: 



S 



qF + 4>{q) + + 7(9) 



dt. 



(3.2) 



(3.3) 



3.2 Euler-Lagrange equation 

As we know from the variational analysis, the extremal trajectory of the integral J L{q, q) dt satisfies 
the Euler-Lagrange equation 



dL{q,q) = d dL(q,q) 
dq dt dq 



Thus, for the modified Lagrangian we obtain the following equation 



4>'(q) = j t [F + ^(q)+l'(q) 



(3.4) 



(3.5) 



which needs to be solved in the limit — ¥ 00, — > 00. 

There are two types of solution of the latter equation. One is obtained on the interval where the 
trajectory remains strictly within the boundaries 

q(t) G (0 rain i ^cmax ) ■ 

The second solution type is, when the trajectory lies on the boundary 

q(t) = Qmin(t), Or q(t) = Qrnax(t) ■ 

The general solution consists of pieces of the solutions of the types one and two. Let us consider 
each of these types separately. 



3.3 Solution within the boundaries 

First we consider the solution in the interval, where the trajectory does not touch the boundary. 
In the limit N<j> — > 00 the l.h.s. of Eq. Q3.5P vanishes. We thus have 

F(t) + TP'(q)+ 7 '(q) = C, (3.6) 

where C is constant for the whole period of time, where the solution does not touch the boundary. 

Let us consider one particular example of the operating cost function. It is rather typical for a 
gas storage facility that the operating cost is proportional to the amount of the gas released or 
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injected into the storage. Again we parametrise it with iV 7 so that for finite iV 7 the function 7(g) 
is smooth, and in the limit iV 7 — > 00 it is piecewise linear 

7(4) = I lmi f l4> V (3-7) 
' ' \ ~lreiq, for q < 0; v ' 

where 7j„j > and j re i > 0. 

The solution of Eq. (|3.6p is now straightforward to find graphically (see Fig. [1}. In case r m i n < 
0, r ma2; > we obtain: 

I 'min (*) 7 -FW > C + 7reZ ; 

?(t) = < , C - j inj < F{t) <C + lrel ; (3.8) 

I ' max 

(t) , F(t) < C linj ! 

We see that the value C can be interpreted as a trigger price. If the price F(t) is within the interval 
[C — "finj, C + lrel], the extremal trajectory is constant: q — 0. If the price is below this interval, 
the volume q(t) is increasing at maximal rate: q(t) — r max (t). If the price is above the interval, 
the volume is decreasing at maximum rate: q(t) = r m i n (t). 



^(q)+l'(q) > 

linj 


\ 


C - F(t) 


r 


min 


T-max n 
~lrel y 



Figure 1: Graphical illustration of Eq. (13.6[) . 



3.4 Solution on the boundary 

We consider only a boundary Q m in(t) and Q m ax(t), that moves slower than the limit rate, i.e. 
Tmin < Omm < r max and r min < Q max < r max . If the boundary moves faster, then no trajectory 
can lie on it. 

On the boundary the function ip'(q) vanishes, and the function <j)' '(q) takes a finite value (positive 
if q = Qmax or negative if q = Q m in)- Eq. (|3.5[) on the boundary becomes 

<t>'(q) = f t [F(t)+ 1 '(q)]; (3.9) 

There is another view on the solution on the boundary. We may consider our problem as a general 
variational problem with spacial boundary condition. A path lying on the boundary can not be 
varied and hence be a part of extremal. But a solution, that maximises the functional, can consist of 
parts lying on the boundary and extremals - pieces of trajectory satisfying the extremal condition 
(i.e., vanishing first variation). 
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3.5 Approaching the boundary 

An important statement can be made about the connection point between extremal and the bound- 
ary. From the variational analysis it is known that an extremal touches the boundary smoothly. 

The extremal solution of our variational problem for large but finite iV^,.^,. 7 is a smooth function 
and must satisfy the condition of smooth connection between extremal and boundary. Let us 
consider a simplified problem (a generalisation is straightforward), where we suppose a constant 
boundary condition, and zero operating cost: 

Qrnax = Const, Q m in = COHSt, "/(q) = . 

In this case the extremal trajectory should touch the boundary at zero slope: 
q(t*) = 0, 

where t* is the connection time. From Eq. Q3.8p in the limit 7 re / - ► and ji n j — > we conclude 
that 

F(t*) = C (3.10) 
Taking the operating cost again into consideration leads to the following cases: 

1. The trajectory touches the lower boundary from the left: 

F(i*) = C + 7rei ; F(t*)<0. (3.11) 

2. The trajectory touches the upper boundary from the left: 

F{t*) = C- linj ; F(t*)>Q. (3.12) 

3. The trajectory touches the lower boundary from the right: 

F(t*) = C - j inj ; F(t*)<0. (3.13) 

4. The trajectory touches the upper boundary from the right: 

F(t*) = C + 7 ,. e/ ; F(t*)>0. (3.14) 

These conditions must be satisfied for the time interval t € (0, T e ). The extremal trajectory on the 
times t = and t ~T e does not have to satisfy these conditions. 

The conditions (j3.lH3.f4")) can be derived from a rule of thumb: if at the moment t* of boundary 
touch the boundary was virtually not there, the trajectory had to switch the mode anyway (like 
from "full release" to "do nothing" in case (|3.11l) . et.c). The requirement to the price curve 
derivative follows from the simple observation: when the trajectory approaches the boundary, the 
forward curve should enter the dead zone, and when the trajectory leaves the boundary, the forward 
curve should exit the dead zone. 

In the limit iV^.^-y — > oo the solution may be non smooth any more. In particular the connection 
point q(t*) will be a point of the slope discontinuity. 

It is worth emphasising that the constant C may be different for different extremal pieces of 
trajectory separated by the boundary touch. 
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3.6 Conclusion 



Eq. (|3.8p together with boundary conditions (j3.lH3.Lljl and condition on the start and end volume 

5(0) = Qstart] q{T e ) = Q end = Qstart + <j dt (3.15) 

Jo 

provides an implicit solution for the problem. We summarise some properties of the solution: 

1. The optimal exercise strategy is always bang-bang. If expressed in "industrial" language, the 
compressor either stands or works with a full power. 

2. For each piece of trajectory, separated from others with a boundary touch, there is a trigger 
price C and a "dead zone" [C — 7m j, G + 7rei]> The volume in the storage is increasing 
at maximum rate if the price is below the zone, decreasing at maximum rate if the price is 
above the zone, and is constant if the price is in the zone. The width of the zone is defined 
by the operating costs. 

3. If the trajectory touches the boundary in the time interval t € (0,T e ), it must satisfy one of 
the boundary conditions (13.lH3.14l) . 

4. Generally a trajectory satisfying all the previous conditions is not unique. Any solution must 
obey all these conditions, but not any function obeying all conditions is a solution. 

The solution found in the previous section maximises the modified target functional S[q(t)]. Let 
q(t) be the obtained solution. Now the optimal path q(t) can be substituted to the unmodified 
functional So[q(t)] to give the value of option. A natural question is whether the trajectory q(t), 
which is optimal for modified functional, is still optimal for the unmodified one. 

The optimal trajectory q belongs to the broad class of all locally integrable functions C%. Let 
us introduce a class of trajectories JC C C\ satisfying all the constraints. By definition penalty 
functions vanish on JC: 

ct>(q) = 0; #0 = 0, iff qeJC. 

Obviously the values of modified and unmodified functionals coincide on JC: 

S[q]=S [q], VqEJC. 

Next we show that the optimal trajectory q belongs to the class JC and hence satisfies the con- 
straints. Indeed, generally on some parts of the maximal trajectory ip'{q) 7^ and 4>'(q) 0. 
According to Eas. (l3.6p and (|3.9[) they take finite values. One can show (we leave it without proof) 
that in the limit — > oo everywhere where <f>' (or ip') take finite value, the function 4> (or ip) 
vanishes. We conclude that on the extremal trajectory 

<t>{q) =0; i>(q) = 0. (3.16) 

and hence q € JC. Form this it follows that the values of modified and unmodified target functionals 
coincide on the maximal trajectory: 

S[q] = S [q] . 

Last step is to prove that the trajectory q delivers the maximum of the unmodified functional. It 
is obvious from the following consideration. The trajectory q maximises the modified functional 
on the space C\ and hence it will also maximise this functional on the smaller space JC C C\. Since 
the values of both functionals coincide on JC, the trajectory q will also maximise the unmodified 
functional So on the space of all trajectories satisfying constraints. 

From this point we drop the subscript "0" for the action integral, unless we want to emphasise 
the difference between modified and unmodified target functions. We also drop the "bar" for the 
optimal trajectory q(t) everywhere where it does not lead to ambiguity. 
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3.7 Other constraint types 



3.7.1 Carry cost 

Some storage contracts may include a "carry cost" , which is the cost of keeping the commodity in 
the storage. For instance an oil storage is subject to a heating cost (a fuel oil in a storage has to 
be kept warm at approximately 60°C). 

The carry cost is specified as a time-dependent price j c (t) per unit time per unit volume. Thus, 
the target function becomes 



So — — 



T, 



qF(t)+j(q)+i c {t)q(t) dt (3.17) 



Substituting the new Lagrangian into the Lagrange-Euler equation we obtain between boundaries: 
F(t)+^{q) + i(q) = C(t) (3.18) 

where 

dC(t) 



dt 



7c(t) ■ (3.19) 



Thus, the solution of the problem with the carry cost is absolutely the same as that without but the 
only difference: the trigger price is not a constant, rather a function of time satisfying Eq. (|3.19l) . 



3.7.2 Solution with free terminal condition 

Sometimes the storage problem can be defined with a free terminal condition. For instance, one is 
allowed to leave an arbitrary amount of underlying in the storage, and for the remaining volume 
one gets a additional pay-off equivalent to selling that volume at some effective price F e , usually 
referred to as a final unit price. We define the target functional S as 

S[q(t)} = - { ' [q(t) F(t) + j(q)} dt + q end F e . (3.20) 



The optimisation problem is now to find an optimal exercise trajectory q(t) , q en d — q(t en d) — q(T e ), 
satisfying operational constraints, that would maximise the target functions p.20p . It is easy to 
see that any extremal trajectory of (|3.20[) is also extremal of (|2.4I) . Indeed, if the trajectory q(t) is 
extremal of (|3.20l) . the variation of the target functional vanishes on any small variation Sq(t). Since 
there is no terminal boundary condition, the variation 5q(t) may not vanish at the end point. But 
if the target function does not change on the whole class of allowed trajectories q(t) = q(t) + 5q(t), 
it also does not change on the sub-class of trajectories q(t) = q(t) + Sq(t) with fixed terminal value 
(Sq(T e ) = 0). 

We conclude that to find a maximising trajectory for the functional (|3.20p , we need first to find a 
solution for the functional (|2.4|) with fixed terminal conditions. Then, considering this solution as 
a function of terminal state q en d, we need to find a maximum of (|3.20|) as a plain function of q en d- 

As will be shown in the Sec. 13.81 if the target functional (|2.4p is considered as a function of the 
terminal state q en d, then its derivative with respect to q en d is 

ds ° =-C. 



dq en d 

Thus, the target functional (|3.20l) . considered as a function of q en d, has a derivative 

^=F e ~C, (3.21) 

Cq e nd 
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where C is the trigger price of the last part of trajectory. We conclude that the condition for a 
target functional with free terminal state to have maximum is 

C = F e . (3.22) 

In practice the storage option with free terminal state has two possibilities. One possibility is that 
the terminal state lies on the boundary Q m in or Qmax- In this case the problem has a solution 
with fixed end. Another possibility is that the trajectory ends between boundaries Q m in < q{T e ) < 
Qmax- In this case the condition (|3.22p must be fulfilled. 

If the storage option has no final unit price (price for the remaining volume), it is most likely 
to terminate with the lowest possible volume. Indeed, as we already know, the derivative of the 
storage option value with respect to the terminal state equals — C. If we neglect the operating 
costs, the trigger price can not be smaller than the smallest gas price. Hence, if the gas price is 
positive, then the derivative of the option value with respect to the terminal level is negative, and 
thus the optimal trajectory must terminate at the lowest possible level. Consequently the storage 
option can be considered as a problem with fixed terminal state (although it may not be fixed 
according to the contract). 

An opposite example is a swing option. A typical swing option is a contract between a gas supplier 
and a trading company. The trader buys gas at some predefined price from the supplier and sells 
it on the market at the current market price. The spread between the market and supplier price 
becomes the effective gas price for the trader. The swing option allows the trader to take the gas 
from the supplier according to some flexible scheme (i.e. to decide, when to take and how much 
within certain constraints). Thus, the swing option can be formulated in terms of a storage option. 
However unlike the storage option, the effective price (the spread) in the case of swing option can 
be both positive and negative. Depending on the actual prices and on the contract constraints, 
it may happen that the terminal state does not lie on either lower or upper boundaries. In this 
case we deal with the problem with the free terminal condition. For this problem the trigger level 
equals zero 

C* = 0, (3.23) 
since there is no equivalent for the final unit price in the swing contract. 



3.7.3 Cycle constraint 

The cycle constraint (|2.9[) is formulated as a maximum allowed injected volume during the operation 
period t € [0, T e ]. Instead of considering it in the form of inequality we can proceed as follows. We 
find a solution in two steps. On the first step we solve the problem without the cycle constraint and 
calculate the cycle variable on the optimal trajectory. If the cycle variable is below the threshold 
Cmax, the obtained solution satisfies the cycle constraint. However if the solution violates the cycle 
constraint, we can reformulate the problem, imposing the "biting" cycle constraint 

c(T e ) = / " q(t) 9{q(t)) dt = c max (3.24) 



o 



This condition allows to formulate our problem in terms of conditional variational extremum. It 
is solved by means of Lagrange multiplier. Namely we introduce a modified Lagrangian 



L = — 



?F + 0fa) + V(«) + 7(<?) + Ag(t)0(g(t)) . (3-25) 



where A is an independent variable (Lagrange multiplier) . The solution of the latter problem will 
contain the undefined coefficient A, which has to be found from the additional equation (I3.24p . 

Within the boundaries the Lagrange-Euler equation for the constrained Lagrangian becomes (com- 
pare to Eq. ([3T6"])) 

C = F(t) + ip'{q) + i{q) + A (9{q) + q 5{q)) = F(t) + i//(q) + 7 '(<j) + A 6{q) . (3.26) 



9 



Solving this equation we find that the solution is similar to that without cycle constraint, but 
has the "dead zone" by A wider. Thus, the cycle constraint is equivalent to some additional 
operating costs. One can find a Active additional release and/or injection costs (same for the whole 
trajectory), such that if added to the storage without cycle constraint, the resulting trajectory 
would be extremal for the Lagrangian with imposed cycle constraint. It plays no role if the 
additional cost is for injection, release or both, since it is only the width of the dead zone which 
matters. The only exception from this rule is if the optimal trajectory does not inject or release 
enough volume. For instance if the trajectory never releases gas, the additional release cost will 
not effect the trajectory and will not help to respect the intake cycle constraint. Practically one 
should add injection costs to fulfil the intake cycle constraint, and release costs to fulfil the release 
cycle constraint. 

Generally the solution between boundary touches contains two free parameters - trigger price C 
and cycle Lagrange multiplier A. They can be equivalently expressed by upper and lower bounds 
of the dead zone. These parameters have to be found in such a way that the obtained solution 
satisfies boundary touch conditions (|3.1HI3.14"f , initial and terminal conditions (|3. 15|) and the cycle 
constraint (|3.24|) . 



3.7.4 Volume dependent injection/release rates 

If the maximal injection/release rates are volume dependent, the penalty function ip becomes an 
explicit function of volume: -0 = ip(q,q). In this case the Euler-Lagrange equation within the 
boundaries becomes 

8 q iP(q, q) = j t [F + d q i>(q, q) + 7' («)] ■ (3.27) 
The l.h.s. of this equation is an integrable implicit function of time. Designating 
dC(t) 



dt 



= d q ^q(T),q(T)); C(t)=C + 0, V(?(t),4(t)) dr 



we obtain the same solution as in the case of volume independent rates with the difference that 
C is not constant any longer, rather a function of time. We conclude that the optimal exercise 
strategy still preserves the "bang-bang" property. However is does not posses the constant trigger 
level. The function C(t) can be reverse engineered from the optimal trajectory q(t), provided the 
latter is been found. 

Let q(t) be the optimal trajectory. On this trajectory the derivative d q ip(q,q) takes a finite value, 
which can be found from the equation 

d q lP{q{t),q{t)) = C(t)-F(t)- 1 '(q(t)). 

We can also find the relation between the derivatives d q ip(q,q) and d q ip(q,q)- Indeed, if the 
trajectory q(t) lies for instance on the maximum injection rate boundary q = r max (q), then 
the relation between dq and dq is simply dq — r' max (q) dq. It's easy to show that the gradient 
{d q tp(q,q),d q ip(q,q)} must be orthogonal to the vector {dq,dq}. Hence 

d q ?P(q, q) = r' max (q) d q tp(q, q) . 

We thus obtain the differential equation 



iU .'(q(t))(c(t)-F(t)- 1 '(q(jt)j)- r'(q(t)) ^ 



q=q(t) 



Here r(q) = r max must be used for "injection" and r min for "release" parts of the trajectory. This 
equation allows to find the function C(t) provided the optimal trajectory q(t) is already known. 
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3.8 Dependency on initial and terminal conditions 



The optimal path q(t) and the target functional S^t)] are calculated for fixed terminal point 
{T e , Qend}- If this point is slightly shifted in either direction - time or volume - the optimal path 
becomes different, and the target function as well. In this sense the target function can be viewed 
as a function of terminal point S = S(q, £). In this section we are interested in the properties of 
this function. 



From the standard variational analysis we have: 

dS_ _ dL 
dq dq ' 



9S__ L _ .dL 
dt ^ dq 



(3.28) 



We apply these equations to the perturbed action S[q(t)]. Making use of Eq. (13.161) we will find 
the derivatives of the unperturbed action So [<?(£)]. 



3.8.1 Volume derivative 



Using the definition 



of the perturbed Lagrangian we find 



dS_ 

Oq 



The spacial derivative makes sense only within the boundaries, 
replacing S with Sq we finally find 



dS 

dqend 



OS 



-c, 



lend 



Making use of Eq. (|3.6[) and 



(3.29) 



where C is the trigger level of the last part of the trajectory. If C is positive, one can draw a 
conclusion, that the smaller the terminal state q(T e ), the bigger is the target functional So(T e ). 

Derivative with respect to the initial condition q s t a rt is given by inverting the sign: 



dSn 



d,s 



dq s 



dq s 



= C, 



(3.30) 



where C is the trigger level of the initial part of the trajectory. 

From this an important consequence follows, which we already mentioned in the section 13.7.21 If 
there is no final unit price and if the prices F(t) are strictly positive, so is the trigger price C. As 
a result the option value has a negative derivative with respect to the terminal level. Hence the 
optimal trajectory will always take at the end the smallest possible value allowed by constraints. By 
other words, we know that the right end of the optimal trajectory will lie on the lower boundary. 
Thus, even if the contract allows the storage to end with some remaining volume inside (free 
terminal condition), the optimisation problem still can be considered as the one with fixed terminal 
condition. 

This is not any longer the case in presence a positive final unit price is given or if the prices F(t) 
can be negative. With the final unit price F e the option value derivative is given by Eq. (|3.2ip and 
can be positive, negative or zero. Also extremely high injection/release costs may lead to vanishing 
trigger price and the option value derivative. 

3.8.2 Time derivative 



From definition we find 
dS 



dt 



= q 



-<j>(q) -4>{q) -7(3) 
- 7(9) • 



(3.31) 
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Within the boundaries we make use of Eq. (13 .€>[) . Replacing S with So one obtains 

= = <K*e«<i) C - FfW) - 7(9) , (3.32) 

where q(t enc i) is the solution (13.81) obtained for fixed terminal condition. In particular one can see 
that dSo/dt en d > (provided r min < 0,r rnax > 0). 

On the boundary in the special case Q max — Q m in = one obtains q — 0, j(q) — and 
dS 



= 0. 



(3.33) 



It is worth noting that the derivatives with respect to terminal (initial) boundary conditions is 
universal: it is independent on whether the optimal trajectory touches the boundary or not. To 
calculate the derivatives, one just needs to know the solution q(t) and the trigger price C at the 
end (beginning) of the trajectory. This also remains valid with imposed cycle constraint. 



4 Stochastic problem 

The following section is devoted to the stochastic optimisation problem. As a starting point of 
the stochastic approach we take the solution of the corresponding intrinsic problem, which is 
supposed to be known. Under some very generic assumptions about the price process we derive 
a stochastic differential equation which governs the evolution of the storage value in the rolling 
intrinsic approximation. This allows us to find the expectation value of the terminal option value, 
and hence to estimate the time value of the storage option. 

In real world the prices are not static, rather evolve with time in a stochastic way. Every time 
we observe a new forward curve on the market, we can solve a new intrinsic problem, obtaining a 
new solution. The forward price F, as well as the optimal intrinsic strategy q and trigger price C 
become not only functions of "delivery" (or "maturity") time T, but also functions of the "current" 
(or "observation") time t: 

C = C{t); F — F(t, T) ; £=£(i,T). 

By other words, F(t, T) is the forward price with delivery time T observed on the market at time t. 
C(t) and q(t,T) are trigger price and optimal intrinsic strategy, calculated at time t according to 
current observed forward price curve F(t,T). Throughout the following section we designate 

r(t,T) = q(t,T) 

the optimal intrinsic exercise strategy on the observation time t. 

By definition of the stochastic problem, the initial forward curve F(0, T) is known. For any future 
observation time t > there is no deterministic forward curve. The evolution of the forward 
curve is supposed to satisfy some stochastic differential equation. It means that any possible curve 
F(t, T) can be assigned a certain probability amplitude, and the solution of the problem has to be 
formulated in the probabilistic language. 



Assumptions: To make further analysis we make some simplifying assumptions. 

• First we neglect the injection/release costs by setting 7 = 0. 

• We also neglect the "surface effects" - the possible influence of the boundary touch on the 
optimal trajectory and on the storage option value. Under the specified assumptions for 
every observation time t there exists a single trigger price C(t) for the whole trajectory, and 
the optimal intrinsic strategy can be presented in the form 

r(t, T) - r rnm + Ar 9{C{t) ~ F(t, T)) , (4.1) 
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where Ar = r max — r m i n , 9(x) is the Heaviside function and C(t) is the intrinsic trigger price 
at time t. We also introduce the following notation: 



r« (t, T) = Ar S(C(t) - F{t, T)) ; (t, T) = Ar <5'(C(t) - F(t, T)) 



(2), 



(4.2) 



Here <$(x) = 9'{x) is the Dirac delta function. The derivative is taken with respect to the 
argument of the (5-function. Notice the dimensionality: 



~Q~ 

t 



A 1 ) - 



Q 



,(2) _ 



Q 



(4.3) 



By neglecting the "surface effects" we restrict our consideration to the so called non-flexible 
storages. The storage flexibility can be characterised by a number of times N c the storage can 
be filled and emptied completely during the storage life time (maximum number of cycles). 
The storage is usually called non- flexible if iV c < 1. For N c 3> 1 the storage is referred to as 
flexible. Obviously, the surface effects have much bigger impact on the flexible storages. 

• As an additional assumption we believe that the prices F(t, T) are always strictly positive. 
This allows us to consider the optimisation problem as a problem with fixed initial and 
terminal conditions (see Sec. 13.8. This condition is applicable for the storage option and is 
not for the swing option. The swing option will be considered separately in the section |4~TT1 



4.1 Formulation of the stochastic optimisation problem 

For the intrinsic problem one could determine an optimal exercise strategy r(T) which has a 
meaning of amount (per unit time) of underlying which has to be injected/released from the 
storage and bought/sold on the market at the time T. It makes no difference if this amount is sold 
on the spot (i.e. exactly on the time T) or on the forward market prior the delivery time T, since 
the price of the underlying does not change. 

This is not the case in the stochastic reality. Since the price for the delivery on time T changes with 
the observation time t, it is essential to specify when exactly the underlying is traded. We thus 
distinguish between spot trades and forward trades. A spot trade is associated with immediate 
delivery, and the corresponding price is referred to as "spot price" s(t) = F(t,T — t). A forward 
trade has a delivery in the future, and generally can be any kind of financial products, including 
forward contracts or options. The forward trades usually have a purpose of financial hedging, so 
we will refer to them as hedge trades. For simplicity we only consider linear hedge products, i.e. 
forward contracts. Thus for the stochastic problem at every time t we can introduce a trade profile 
h(t,T), which has the following meaning: 

• The up-front value h(t, t) is the exercise trade. The amount of underlying h(t, t) dt has to be 
injected in the storage within the time dt, and the corresponding amount has to be purchased 
on the market. The purchased amount is a net amount purchased on the forward and spot 
markets for the delivery on T = t. We will refer to the exercise trade h(t,t) as prompt 
exercise. 

• The volume h(t, T) for T > t is a hedge position. The value h(t, T) dT has a meaning of the 
total volume purchased on the forward market (i.e. aggregated volume of all forward trades 
performed during the time period [0, i\) for the delivery period [T, T + dT]. 

For the stochastic optimisation problem there exists no optimal exercise trajectory, which could be 
computed prior or during the contract time. Hence it is not possible do define a target functional 
in the form of integral (|2.4p . Instead we have to formulate the aim of the stochastic optimisation 
problem as finding an optimal trading strategy h(t, T), such that the expectation value of the total 
profit is maximised. 
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The trading profile can be used for exercise and hedge trades on the future market, giving rise to 
the cash flow at the observation timo Since the hedge profile is different for every time moment t, 
one has to update the hedge position continuously by buying and selling a proper amount of the 
underlying on the future and spot markets. Let h(t,T) be the hedge position taken by the time t. 
This also includes the exercise volume h(t, t) dt, which has to be delivered during the time dt. 
After the infinitesimal time increment dt we observe the new prices on the market, and find a new 
optimal hedge strategy h(t + dt,T). The volume 

Sh(t, T) = h(t + dt, T) - h(t, T) 

has to be purchased on the market in order to update the hedge position. This volume includes 
the spot trade 5h(t, t) dt and hedge trades 5h(t, T) dT for T > t. Of the whole hedge profile h(t, T) 
the only initial value 

h(t,T)\ T=t . (4.4) 

is associated with the "physical" delivery of the underlying, and the rest is covered by the forward 
contracts. 

Our aim now is to find for every observation time t an optimal hedge profile h(t, T) conditional on 
the market state F(t, T) and current initial condition q(t). In order to give a precise mathematical 
definition of the stochastic optimisation problem we need to define the stochastic profit function. 

Let h(0, T) be the initial hedge profile, calculated on the basis of the initial conditions F(0, T) and 
q(0) = Q start- The integral 

P = - [ h(Q, T) F(0, T) dt . (4.5) 
Jo 

gives the total cost of setting up the initial hedge profile. Let h(t, T) be the hedge profile at time 
t. We introduce the profit function P(t), which is defined as a cumulative cash flow from all hedge 
trades. The profit function satisfies the following equations: 

P(0) = P = - [ h(0,T)F(0,T)dT (4.6) 
Jo 

dP(t) = -J Sh(t,T) (F(t,T) + SF(t,T)^j dT , (4.7) 

where Sh and SF are the increments of the optimal hedge strategy and forward curve within a 
time-interval dt. The stochastic optimisation problem can now be formulated as finding at every 
time t an optimal hedge profile h(t, T) such that the expectation value of the terminal profit P(T e ) 
is maximised. 

The defined profit function does not represent the total portfolio value, since it only reflects the 
cash-flow and does not take into account the value of the storage option itself. However this is the 
only part of the portfolio, which can be influenced by our trading strategy, and hence optimisation 
of the profit function is equivalent to the optimisation of the Profit and Loss. 

Note that the hedge profile h(t, T) for T > t does not need to satisfy the storage constraints, since 
it is not associated with the physical delivery. The only physical part of the hedge profile is the 
prompt exercise trade h(t,t) which has to obey the storage constraints. 

Notice that the integral (I4.5[) coincides with the target functional (|2.4p (we have neglected the 
operating costs). As an example of a (non-optimal) trading strategy we can do the following. Let 
the initial hedge profile coincides with the initial intrinsic strategy 

h(0,T) = r(0,T) 



2 In reality the exact timing of the cash flow depends on the type of financial instruments used for hedging, and 
can be spread between observation and delivery time. However it plays no role for our analysis, and we agree to 
associate the cash-flow from the forward deals with the trading (observation) time t. Thus we agree that if the 
volume V is purchased on the observation time t with delivery on T, then the corresponding cash-flow — V F(t, T) 
is associated with the observation time t. 
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In this case the initial value of the profit function coincides with the intrinsic value P(0) = Pq = Sq. 
If we decided not to change the hedge profile with the tim^l 

h(t,T) = h(0,T) 

then the profit function would remain constant, giving rise to the terminal profit equal to the initial 
intrinsic value. Thus the stochastic profit is at least as big as the intrinsic value. In reality for 
every time t one can almost always find a new hedge profile h(t, T) such that the profit function 
increment SP is positive. The increase of the terminal profit function in comparison with the 
intrinsic value is referred to as time value. 

Running ahead it is also interesting to notice that only the prompt exercise decision h(t, t) is 
relevant for the expectation value of the profit. The rest of the hedge profile h(t, T) for T > t only 
effects the distribution of the terminal profit as a stochastic value. If our model of the stochastic 
price process was exact, and if we neglected the market frictions, then it would be theoretically 
possible to find such a hedge profile, which would make the distribution of the terminal profit 
function arbitrarily narrow. 



4.2 Rolling intrinsic strategy 

The optimal intrinsic solution at time t is a function q(T) — r(t,T), which maximises the deter- 
ministic functional 

r(t,T)F{t,T)dT (4.8) 

given the initial condition q(t) and the market state F(t, T) at the time t, and subject to the 
storage constraints. 

If at every time moment t the hedge strategy h(t,T) coincides with the current optimal intrinsic 
exercise 

h(t,T) = r(t,T) , 

then the strategy is called rolling intrinsic. 

In the Sec. 14.51 we show that under very general assumptions the rolling intrinsic is very good 
approximation of the optimal stochastic exercise strategy. Based on this fact and taking into 
account that the hedge strategy does not effect the expected value of the profit function, one can 
conclude that if we follow the rolling intrinsic strategy, the expected value of the terminal profit 
must be equal the true option value. This also allows one to calculate the time value as an average 
cumulative cash flow from all hedge and exercise trades. In App. [B] we give a systematic proof of 
this fact. 



4.3 Forward curve evolution model 

Let F(t, T) for each time T be a zero mean stochastic process governed by the stochastic differential 
equation in Ito representation 

^2 =<iM«(T), (4.9) 

where M t (T) is a martingale process with respect to the variable t parametrised by the variable T. 
In general for each future time T the martingale processes M t (T) are different processes, which 
might or might not be correlated. 



3 We can do so because the intrinsic profile satisfies the storage constraints, and hence we will be able to fulfil 
the contractual obligations. 
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The correlation between processes M(T) is given by 

( dM t (Ti ) cLM t (T 2 ) ) = a 2 (t,T u T 2 )dt; ( dM tl (Ti ) dM t2 (T 2 ) > = for h^t 2 , 

where the function a(t, Ti, T 2 ) is supposed to be a decaying function of \T\— 1\, |T 2 — t|, and |T 2 — T\\. 
In the stochastic calculus the product dM tl (Ti) dM t2 (T 2 ) can be considered as deterministic and 
requires no averaging. 

Let us designate 

L(t, Ti, T 2 ) = dF(t, Ti) dF(t, T 2 ) = F(t, T x ) F(t, T 2 ) <7 2 (i, Ti, T 2 ) A . (4.10) 

Notice that L is a stochastic variable, since it depends explicitly on the stochastic prices F. 
We define the price process correlation function as 

A(t,Ti,T 2 ) = i (T(t,T 1 ,T 2 )) ir =a 2 (t,T 1 ,T 2 ) (F(i, Ti) F(i, T 2 )) , (4.11) 

where (•)„ is an averaging over the stochastic prices F. 
As a particular example we consider the following process 

dM t {T) = a e~ a ^dW t {T) , (4.12) 
(dAf t (Ti)dM t (T 2 )) = oge-^-Oe-o^-Og-fllT,-^! (ft< (4 13) 

where Wt is a standard Brownian motion. The price process has the following correlation function 

A(i,Ti,r 2 ) ^^g-aCTx-Og-aCTa-Og-^ITa-Txl (i?( t) Tl ) F(t, T 2 )) . (4.14) 

Note that the case /3 — corresponds to the standard one-factor forward curve model. In this case 
the whole curve is driven by a single Wiener process Wt- 

To find the correlation (F(t, Ti) F(t, T 2 )) we need to integrate the price process (|4.9p . For one- 
factor model we obtain 

(F(t,T 1 )F(t,T 2 )}=Fo(T 1 )F (T 2 )l(t,T 1 ,T 2 ), where (4.15) 



l(t,T 1 ,T 2 ) = cxp 



^0 e -a (ti+t 2 ) A _ e -2on 

2a v ; 



n = Ti - £ ; t 2 = T 2 - i . 



Note also that after a relaxation time « 1/a the normalised correlation function Z becomes sta- 
tionary, i.e. depending only on the time differences T\ and t 2 . 

It can be easily shown that for small volatility ct 2 t <C 1 the function Z(t, Ti, T 2 ) sa 1, and thus for 
one-factor model we get 

A(t, Tx, T 2 ) » a 2 e - Q (^+^) F (Ti) F (T 2 ) . (4.16) 



4.4 Constraint surface 

We apply a variational analysis technique to find the increments of the hedge position Sh(t, T) in 
the rolling intrinsic approximation, and to derive a stochastic differential equation for the storage 
option value evolution based on Eqs. (|4.6I) and (|4.7p . 

When finding the functional derivatives, it is important to distinguish between the two following 
cases. Let C[<j>\{t), 2 (i)] be a functional defined on the functions 4>\{t) and </> 2 (i). If they are 
considered as independent variables, the corresponding functional derivatives 

sc sc 

sMt) an S^t) 
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are analogous to the partial derivatives of an ordinary function of two variables. 
On the other hand, if the variables <f>i and </>2 are dependent, then the variation 

5C = s^T) 6Mt) + s^f) SMt) ; 6Mt) = f{SMt)) 

is analogous to a total derivative of an ordinary function. 

Let us consider a functional C defined on the variables t,C(t), F(t,T). If we consider a time 
evolution of the forward curve F(t,T), then all three increments dt,SC(t) and 6F(t,T) become 
dependent. The relation between these increments follows from the requirement that the terminal 
level must be preserved: 

SQend = ; where 

Qend = Q(t)+ I ' r(t,T)dT; (4.17) 



The equation 5Q en d — for every time t defines a surface in the space of F(t,T) and C(t), which 
restricts the possible simultaneous variations of the trigger price and forward curve. We say that 
the equation SQ en d = defines a constraint surface. If we find a variation 5C which takes into 
account the relation between 5t, SC(t) and SF(t,T), we refer to this variation as a variation on 
the constraint surface. 

Since the time-dependent increment of the forward curve SF is driven by a Wiener-type stochastic 
equation of motion, we will treat this increment in a stochastic sense. In the framework of the 
stochastic calculus the square increment SF 2 can not be neglected if compared with the first order 
variation, since 

SF 2 ~ dt . 

Hence the time-dependent variational derivatives must contain the second order terms alongside 
with the first order terms. 

The details of derivation can be found in the Appendix [21 After some algebra we finally obtain 

SC = 1 (/ r« 8FdT-\j r^ SF 2 dT+±JJ r^ P W L uv - jj r« r« L u 

(4.18) 

SC 2 = ^JJr^r^L av , (4.19) 

where we have used the notation 

r^(t,T) = Ar6(C(t)-F(t,T)); r^(i,T) = ArS'(C(t) - F(t,T)) ; 

K(t)= rW(t,T) dT ; M(t) = r^ 2 \t,T)dT; 



L uv = 8F{u) SF(v). 
In particular with this notation we can write 



r W («) r« (v) 6F(u) SF(v) dudv = / / r{p r^ L uv . (4.20) 



-i 2 

r [1) 8Fdt -- 

Eq. (|4.18|) defines the constraint surface. It relates the increments dt,SC(t) and 8F(t,T). Note 
that according to the forward curve evolution model L uv ~ St. 

The first order variation of the trigger price on the constraint surface is then given by 

dc{t) = W) r r(i) {t ' t) 5f{t) dT ■ (42i) 
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4.5 Stochastic trigger price 



In the deterministic formulation of the storage problem, for each predefined forward curve F (t, T) 
we can find an optimal exercise trajectory q(T) = r(t,T). The value of the target function S on 
the optimal trajectory at time t = is called "intrinsic value" . If the prices are stochastic, there 
exist no optimal exercise trajectory, and the solution of the stochastic problem can only provide 
an optimal prompt exercise r{t,t) at every time t. 

In this section we show that the optimal stochastic exercise is bang-bang, and that for every time 
moment t there exist a stochastic trigger price C st {t). We also show that the intrinsic trigger price 
is a good approximation for the stochastic one, and derive the conditions for this approximation. 

Let F{t,T) be the forward curve observed on the time t. We define a spot price process s(t) as 

s(t) :=F(t,t). (4.22) 

Since the forward price curve evolves with time, the spot price s(t) — F(t,t) generally does not 
follow the original forward curve (F(t,t) ^ F(0,t) almost everywhere). The spot process s(t) is 
uniquely defined by the forward price process, and thus to each possible spot price curve s(t) one 
can assign an amplitude or, in case of discrete price and time space, a probability p. 

Let us consider now a set of all possible spot price curves Si(t) each having corresponding proba- 
bility pi. Considering the spot price curve Si(t) as a deterministic function, we can solve a static 
optimisation problem for the target functional 

St = - [ q(t) Si (t)dt. 
Jo 

Thus, for every spot price curve Si(t) we can find a corresponding trigger price Ci, exercise strategy 
Ti(t) and intrinsic value Si. On each spot price path the trigger price is different, and the exercise 
strategy Ti(t) is optimal only for that path. 

Now we are looking for an optimal prompt exercise decision q(0) for the stochastic problem. Since 
we can only make one exercise decision at a time, this exercise decision will not be optimal for some 
spot price paths. If on the ith spot price path the optimal exercise is ^(O) ^ <z(0), the suboptimal 
exercise decision will lead to a loss of value on that path. We need to make such a choice of q that 
would minimise the losses on each path, for which the decision is not optimal. 

During the time interval dt the volume increment in the storage is given by dq — q dt. Thus, on 
the ith spot price path the volume in the storage is by (<?(0) — n(0)) dt bigger than it had to be, 
if the exercise was optimal for that path. It leads to the change of the value on the ith path by 

SSi = (Ci - F(0,0)) (q(0) - ri(0))tft (4.23) 

Indeed, the change of the value has two reasons - additional expenses SSn of buying an addi- 
tional volume, and increased value 5Si2 due to change in the actual storage volume. The price of 
additional volume is 

5S a = -F(p,0) (q(0) - n(0)) dt. 

The change of the value due to change of initial volume follows from the formula dS ' j 'dq s tart = C 
(see Sec. EH]): 

SS i2 = C i (q(0)-r i (0))dt. 

Combining the latter two equations we obtain Eq. (14.231) . Now we average SS over all paths: 

(SS)=Y / Pi(d-FM)(m-n(0))dt, (4.24) 
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where pi is the probability of the ith path. The optimal exercise decision q(0) should maximise 
the expected change of option value (SS). Deriving (SS) with respect to q(0) we obtain 



d (SS) 



Pi ( C * ~ F (°' °)) dt = ( (°) ~ F (°> °)) dt . ( 4 - 25 ) 



where (C) = p» Cj is the trigger price averaged over all possible spot price processes. If the 
spot price F(0, 0) is bigger than the average trigger price (C), then the derivative is negative, and 
the optimal stochastic exercise is the smallest possible allowed by constraints. Similarly if F(0, 0) 
is smaller than the average trigger price, then the optimal exercise is the biggest possible allowed 
by constraints. Thus, the optimal prompt stochastic exercise is 

We see that the optimal stochastic exercise is bang-bang, and that the expected value of the 
intrinsic trigger price (C) can be interpreted as the stochastic trigger price C s t'. 

C st = (C) . (4.27) 

Next we can show that under some assumptions the stochastic trigger price in the leading order of 
Taylor expansion equals the intrinsic trigger price. Indeed, let us designate so(t) that spot price 
path which coincides with the initial forward curve at time t = 0: 

s (t) = F(0,t) . 

The trigger price Co is then the intrinsic trigger price, calculated at time t — on the basis of the 
forward curve F(0,t). All other paths Si(t) are different, and we designate 

Ssi(t) = Si(t) - s (t) . 

This difference is zero at the beginning of time period: 5si(0) = since all paths of the spot price 
process start in the same point Sj(0) = F(0,0). At the end of the period the difference Ssi(T e ) 
may be arbitrary large, but the majority of paths stay within the range 



If 5si(t) remains small, then the difference in the trigger level 5Ci = C{ — Cq can be calculated by 
means of the first order Taylor expansion (|4.21l) : 

SQ w / ' <5(C - *„(*)) Ssi(t) dt . 

Jo 

Since the forward price process is a martingale, we conclude that (Ssi(t)} = 0, and hence in the 
first order (SCi) — 0. Consequently 

C st = (C) =C Q + (SC) w Co , (4.28) 

which means that the stochastic trigger price equals the intrinsic trigger price. Since any time 
moment t during the exercise period can be considered as a starting point, the following statement 
holds: for any time t the optimal stochastic exercise is bang-bang, and the stochastic trigger price 
C s t(t) is approximately equal to the intrinsic trigger price Co(t), calculated at time t on the basis 
of the deterministic forward curve F(t,T). 

Note that to derive the relation (|4.27D we did not use any approximation, and hence this relation 
is exact. For derivation of relation (|4.28l) we used the only assumption of small variation 



(4.29) 

SQ(T e ) 
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For a 1-factor mean reversion price process the relative spot price variation can be easily estimated: 



i exp 



2 

' I _ r 2aT e 



EL ( 

2a \ 



1 . 



so(T e ) 

In particular, for a T e <C 1 the condition for the intrinsic exercise approximation becomes 



'expfcg T e ) - 1 « 1 , 
and for aT e >l we get the condition 



'exp(||) 

4.6 Variation of the intrinsic target function 

In this section we find a variation of the intrinsic target function on the constraint surface. We 
take into account the dependency between variations of the forward curve and the intrinsic trigger 
price, as in Eq. (|4.18|) . 

Below we omit the argument t and use the notation F(T) = F(t, T). We write the intrinsic target 
function in the form 

S(t) = - J ' r(T) F(T) dT ; where r(T) = r mm + Ar 9(C - F(T)) . (4.30) 
Next we use the expansion 

5S = 5 C S + 8 F S + ^(S FF S + 2 S CF S + 8 CC S) . (4.31) 
The variational derivatives can be found easily: 

5 C S=—5C = -CK5C:, (4.32) 

SfS = J SF(T) dT = J(c r« r) SF dT ; (4.33) 

.- , - - S 2 S 

off j 



SF(u) SF(v) dudv = / (2 r« - f) L uu du ; (4.34) 
SfcS = J 5F 5 * T S )dc SF(T) dT6C = ±JJ (r<» F u - r£>) r v L uv du dv = 

= % I! r^r^ L uv dudv; (4.35) 



SccS = J r< 2 ) F dt SC 2 = - + R C 2 - JJrWr^L uv dudv; (4.36) 

where L uv = 5F(u) SF(v), L uu = 5F 2 {u). 
Combining all variation terms we finally obtain: 

SS = - J r(T) SF(T) dT+^J r« L uu du-^JJ r« r« L uv du dv . (4.37) 

We can easily interpret the first term appearing in the latter expression. The variation 

- / r(T) SF(T) dT 
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reflects the change of the value of the current hedge volume of underlying due to the changed 
forward prices, provided the hedge volume equals the optimal intrinsic exercise r(T). 

Next non- vanishing term is of the second order: 

~ J (T) 8F 2 (T) dT = ^ / 5{C - F{T)) 5F 2 {T) dT . 

Due to the delta-function under the integral this term is reduced to a sum over trigger times 
{Tj : F(Ti) = C}. This term appears due to the change of the perturbed trajectory r(T) + Sr(T). 
It can be shown (see Sec. (|4.7p ) that the perturbed trajectory almost everywhere coincides with the 
unperturbed one r(T) except a few points, where the transition between injection and release takes 
place. These points are the trigger times. Change of the strategy on an infinitely small period of 
time around every trigger time leads to the second order correction of the target function. 

It is worth noticing that the second order correction is always non-negative. This reflects the fact 
that the rolling intrinsic strategy leads to a non- negative drift of the storage value (see below) . 

The term 

YxJJ d 13 r^L uv dudv 
takes into account the cross-correlations between the forward price returns for different maturities. 



4.6.1 Interpretation in financial terms 

In the financial calculus the derivatives of the target function with respect to the different param- 
eters deserve a special attention. The most relevant of them for the portfolio management are 
usually designated by Greek letters, and traditionally called all together "Greeks" . 

One of the most important derivatives is the derivative of the option value V with respect to the 
underlying price F: 

A - — 
~ OF 

It is termed "delta" and can be used for constructing a delta-neutral portfolio - such a portfolio, 
which value does not change under infinitesimal changes of the price. 

The second derivative of the option value with respect to the spot price is termed "gamma" : 

d 2 v 



r = 



Q F 2 



The gamma is a measure of the curvature of the target function with respect to the underlying 
price. It is also responsible for the time value of the option. 

Combining first and second derivatives, the variation of the option value can be represented up to 
the second order as 

dV = AdF+ \vdF 2 . (4.38) 

Comparing the latter equation with Eq. (|4.37p we conclude that the first order term in the expansion 
represents the option "delta" (within the intrinsic strategy approximation): 



A / r[u) 8F(u) du , (4.39) 

and the second order term represents the storage option "gamma" : 

r ~ / r« (u) L uu du-^ jj r« (u) r« («) L uv du dv . (4.40) 
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4.7 Variation of the intrinsic extremal trajectory 



Here we find a variation of the optimal intrinsic trajectory r(T) on the constraint surface. We 
remind that the optimal intrinsic trajectory can be represented as 

r(T)=r min + Ar9(C-F(T)), 

where we have omitted the observation time t. Expanding to the second order in F and C we 
obtain: 

r (T , dr(T) 8r(T) 1 d 2 r(T) 2 d 2 r(T) 1 d 2 r(T) 2 

5r{T) = ^F- SF{T) + ^ dC+ 2 ~dF^ SF (T) + WdC SFdC +2 dC ■ 

(4.41) 

Evaluating the partial derivatives and making use of Eq. (14. 18)) we finally get 

5r(T) = - r V SF + \ r< 2 > L tt ~^6fJ r« SF V + ^JJ r« r« L uv + 

^ ( / r« 5F U -\J r™ L UU + ±JJ r<?) r W Luv _ Jf r (D r W Lra 

(4.42) 

We see that 6r(T) has singularities at the trigger times {Tj : F(Ti) = C}. We need to give an 
interpretation to this fact, since the trajectory correction Sr(T) in this form violates the maximum 
injection/release constraints. We can show that the singular trajectory variation can be interpreted 
as a shift of trigger times Tj by some value ST. 

We can give an interpretation to the obtained singularity of Sr, if we consider it under an integral 
J 5r(T) 4>{T) dt, where <j)(T) is an arbitrary function of time. Approximately the S— function can 
be replaced by a step function with support St and height Ar. The duration ST should be found 
to fit the integral value. 

Let us consider Sr in a form 

Sr(T) = k r (1) (T) = k Ar S(C - F(T)) . 
Let T* be a trigger time. Then the integral around this time gives: 

+ 5r(T) 6(T) dT = I ' Ar 6(T*) . 
T*-e \F(T*)\ 

As stated above, the variation Sr(T) around the trigger time T* can be approximately replaced 
with a step function r/(T) with support ST and height Ar. The width ST can be estimated from 
simple consideration: 

+ Sr(T) 6{T) dT = kAr J^£l- « Ar ST <j>(T*) . 
t*- £ V m ' \F(T*)\ n ^ 



We thus find for the absolute value of time shift: 
k 



ST. 



F(T*) 



(4.43) 



Since the time t* just splits the regions of q = r m i n and q = r max , the previous consideration has 
a meaning, that the singularity of Sr can be interpreted as shifting of the trigger time T* by the 
value ST to the right or left. If k > 0, then the trigger time is always shifted in the direction 
of injection. If k < 0, then the trigger time is always shifted in the direction of release. We can 
represent the time shift, which takes the sign into account: 

£T = -^— . (4.44) 
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An important property of Sr is that the integral J Sr(T) dT must vanish. Indeed, from Eq. (|4. 1 Tf) 
we conclude that 



SQend = J Sr(T) dT = 0. 



Integrating Eq. (|4.42j) and taking into account the definitions K = J r^'(T) dT 
and M = J A 2 ~> (T) dT we find 

J Sr[T) dT = 0. (4.45) 

This proves the consistency of the expansion formula (|4,42[) , 

4.8 Dynamic hedge and time value 

As a "delta" of a vanilla option we understand a derivative of the option value with respect to the 
price of the underlying. This derivative has a dimension of volume and has an obvious meaning: 
this is an amount of the underlying, whose value changes (up to the first order) to the same extent 
under the small variation of the price, as the option value itself. 

The delta can be used for the delta-hedging. If the holder of the option takes a forward (hedge) 
position, which equals the delta in its amount, but has an opposite sign, then the combination 
of both - option and hedge position - has vanishing derivative with respect to the price. This 
means that for a very small variation of the price, i.e. for a very short period of time, the value 
of the portfolio remains constant (up to the first order) . In this case we say that the portfolio is 
delta-neutral. Since the delta itself depends on the price, the position, which was delta-neutral 
at some time point, will not be so after a short period of time. To remain delta-neutral, one has 
to recalculate the delta and update the hedge position continuously. In an ideal situation, if the 
update of the hedge position can be done continuously, and if the market friction can be neglected, 
the value of the portfolio II (which includes the option value, the value of the hedge volume and 
cumulative cash flows) remains constant. 

The similar logic applies to the storage option. We can identify such a volume of underlying, which 
we can use as a hedge position. But unlike the vanilla option the storage option has an array of 
maturity times, which can be thought of as an array of different hedge products, each having 
a different forward price Fk = F(Tk). For every delivery time T^ we can calculate a derivative 
dV(F)/dFk with respect to the forward price on that particular maturity. The delta in this case 
becomes an array: 

which is a discrete analogue of a functional derivative A(T) = SV/SF(T). 

At the beginning of the storage contract the portfolio consists only of the storage option. The initial 
portfolio value (sometimes referred to as "Profit and Loss" - P&L) equals the estimated option 
value. As the storage starts operating, the owner of the storage sells and buys the underlying and 
changes the hedge position. The P&L changes according to the actual cash-flows, option value 
and the value of the hedge position. At the end of the storage contract the terminal P&L consists 
only of the cumulative cash-flow, since no open hedge position is left after the end of the storage 
period. If the hedge position is optimal at any time, then the P&L remains constant, and the 
terminal P&L equals the initial one, i.e. the storage owner earns exactly the amount predicted by 
the stochastic storage model. 

If the storage delta can not be calculated exactly, then the P&L becomes volatile, fluctuating 
around some expected value. Depending on the "quality" of the hedge position, the deviation of 
the terminal P&L from the expected value may become smaller or bigger. 
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Below we show that the expected value of the terminal P&L depends exceptionally on the prompt 
exercise trades (i.e. on the value of the delta at the current time T = t), whereas the width of the 
distribution of the terminal P&L depends on the "quality" of the hedge position. Whatever the 
hedge strategy is used, the correct average time value is obtained only if at every time moment 
the prompt exercise q(t) is optimal. 

In the next section we consider the rolling intrinsic exercise strategy as one of the possible hedge 
strategies. The rolling intrinsic strategy has a number of remarkable properties. One important 
feature of the rolling intrinsic is that it provides an (almost everywhere) optimal exercise decisions, 
and hence guarantees that the expectation value of the P&L equals the true option value. Rolling 
intrinsic also allows to estimate the option time value by calculating the cumulative cash flows 
from the hedge and exercise trades. 

4.8.1 Rolling Intrinsic strategy 

The deterministic consideration of the storage problem - the intrinsic value - gives a good first 
order estimate of the storage value. The intrinsic value can be computed for example by a dynamic 
programming algorithm, and has a remarkable property of the trigger price: the optimal intrinsic 
exercise strategy is bang-bang, each bang triggered by crossing the trigger level by the forward 
price curve. 

The stochastic reality makes the true value of the storage option different, adding to it a time 
value. It appears due to additional profit opportunities on the price fluctuations. As discussed in 
the Sec. 14.51 under some assumptions a good approximation for the "stochastic" trigger price is 
the intrinsic one. It means that an optimal prompt exercise decision at any time moment t can be 
made by means of trigger level C s t(t) for which the intrinsic level Co(t) is a good approximation. 

In Sec. (|4.ip we have introduced the profit function P, which has a meaning of the cumulative 
cash flow from all hedge and exercise trades, and formulated the stochastic optimisation problem 
in terms of probabilistic maximisation of the terminal profit. We repeat the same argumentation 
here in more details in application to the rolling intrinsic strategy. 

Let F{t, T) be the forward price observed in the market at time t. Let also r(t, T) = q(t, T) be the 
intrinsic optimal exercise trajectory calculated on the forward price curve Fit,T). The volume of 
future trading r(0,T) can be interpreted as the intrinsic hedge volume. Prior the exercise period 
the whole volume r(0,T) can be traded forward, providing the owner of the storage option the 
guaranteed profit 

P(0) = - f \(0,T)FiO,T)dT, 
Jo 

which equals the intrinsic value. Although strictly speaking the cash-flow from the forward con- 
tracts takes place at maturity times, we simplify the picture by assigning all the future cash-flows 
to the time moment the forward (hedge) position is taken. Thus, we can say that the intrinsic 
value is locked at the very first time moment. 

During the exercise period the breathing prices cause the change of optimal intrinsic strategy. If 
at time t the price curve is F(t, T), then at time t + dt it becomes different 

F(t + dt, T) = Fit, T) + 8F(t, T) , 

leading to variation of optimal intrinsic strategy 

r(t + dt,T) = r(t,T) + 8r(t,T) . 

The volume Sr(t,T) can be traded continuously (dynamic hedge), leading to continuous change of 
the profit function: 

6P(t) = -J 6r{t,T)(F(t,T) + SF(t,T)^dT (4.47) 
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corresponding to time period dt (again we assign the cash-flows from all future trades to the current 
time moment t) . The described strategy is called "rolling intrinsic" . It assumes that at every time 
moment the option holder takes the hedge position equal to the current optimal intrinsic strategy. 

Since the trajectory r + Sr is optimal for the price curve F + SF, the following inequality must 
hold (we omit the limits of integration and observation time t) 

- J (r(T) + Sr(T)j (f(T) + SF{T)^j dT > - J r(T) (f{T) + SF(T)j dT , (4.48) 

from which follows 

5P(t) > . (4.49) 

Consequently the dynamic hedge in the rolling intrinsic strategy can only increase the value of the 
profit function. In other words, the time value of the storage option is positive (non-negative). 

Interestingly, the inequality (|4.49|) is only valid for rolling intrinsic strategy. 

In the appendix [B] we show how the time value of the storage option can be calculated from the 
cumulative cash flows of a rolling intrinsic hedge and exercise trades. In the following sections we 
apply this technique to estimate the time value of the storage option. 



4.8.2 Meaning of the Dynamic Hedge 

In this section we show that the hedge position is not relevant for the expected value of the storage 
option. The only prompt exercise trade, i.e. the spot trade, is crucial for the expected value of the 
terminal P&L. The hedge position only effects the distribution of the terminal profit. 

Let us consider some delivery time T € [0, T e \. Let also at some observation time t < T the optimal 
hedge position be 

h op t{t, T) . 

We do not specify here how to calculate the optimal hedge position h op t(t,T). We only believe 
that it could be calculated in some way. Suppose that the real hedge position, which is taken at 
time t 

h(t,T) 

is not necessarily equal the optimal one: 
h(t,T) ^ h opt (t,T) 

almost everywhere. On the other hand, we demand that at the beginning the hedge position is 
zero, and as the time approach the maturity, the hedge position has to be equal the optimal one: 

ft(0,T) = 0; h(T,T) = h opt (T,T) for all T. 

The volume of the underlying associated with the hedge position for the delivery period [T, T + 5T] 
is 

h{t,T) 5T. 

Since the initial hedge position is everywhere zero, there is no initial cash-flow, associated with the 
hedge position. The hedge position changes with the time t. The increment of the hedge position 

d 

dh(t,T) = —h(t,T)dt 

leads to the cash-flow 

dP(t, T) = — (F(t, T) + dF(t, T)) dh(t, T) ST , 
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where F(t, T) is the forward price for the delivery on T, observed on t. The total cash-flow per 
delivery period 6T becomes 



T i-T 

F(t,T)dh(t,T)- dF(t,T)dh(t,T) . (4.50) 



P(T) 
5T 

Now we apply integration by parts, which In Ito calculus has to be slightly modified. Indeed, let 
g(t)=y(t)z(t), 

where z(t) and y(t) are correlated stochastic processes. We use the Ito formula to obtain 

dg = — dy + — dz + - dy 2 + dy dz + - 7— dz 2 = z dy + y dz + dy dz . (4.51) 

ay dz 2 ay 1 oyoz 2 oz A 

Integrating from 1 to 2 we obtain the modified "integration by parts" formula: 



2 j-2 j-2 

ydz+ zdy + 
i Ji Ji 



(yz)\l= I ydz+ I zdy+ I dydz; (4.52) 



Applying this formula to Eq. (|4.50p we obtain 

^P(T) = - (F(t, T) h(t, T)) + J q Ht, T) dF(t, T) . (4.53) 

Averaging the obtained expression over the price increments dF, and noticing that 

(h(t, T) dF(t, T)) = h(t, T) (dF(t, T)) = , 
the expected cash-flow becomes 

^ (P(T)) = - (F(T, T) h(T, T)> = - (F(T, T) h opt (T, T)) . (4.54) 

Thus, the expected cash-flow is independent on the hedge strategy hit, T) for t < T, and only 
depends on the prompt exercise trade h(T,T). 

From this an important consequence follows: the rolling intrinsic strategy is a good approximation 
of the exact optimal strategy. Indeed, as has been shown in Sec. 14.51 the intrinsic prompt exercise 
is a very good approximation of the stochastic one, and hence, rolling intrinsic leads to a very good 
expected value of the terminal storage profit. 

On the other hand, the intrinsic hedge profile may be not close to the optimal hedge profile, and 
thus it may lead to rather poor efficiency of hedge in terms of stabilising the profit function. We 
can estimate the "quality" of the intrinsic hedge strategy by calculating the standard deviation of 
the terminal value of the profit function. 

4.9 Time Value 

For the rolling intrinsic strategy the increment of the target function is given by Eq. (|4.47|) . Av- 
eraging the increment SP(t) and its square with respect to the price increment SF we obtain: 

(6P) SF = - J^' (6r(T)) SF F(T) dT - j*' (Sr(T) SF(T)) 5F dT ; (4.55) 



(SP 2 ) SF =JJ (8r(T>) 5r{T")) SF F{T') F(T") dT' dT" , (4.56) 
where (-) dF is the averaging over the stochastic increment dF. 
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Both (SP) SF and ((SP 2 ))sf are proportional to St. Designating 



we can assign to P(t) an effective stochastic process 

dP(t) = fi p (t) dt + a p (t) dW t , (4.58) 

Notice that the average (P)g F as well as \i v and a p are stochastic values, since they depend 
(implicitly or explicitly) on the stochastic price F(T). The drift fj, p and volatility a p averaged over 
the forward price will allow us to find the distribution parameters of the terminal profit P(T e ). 
Indeed, the expectation value and the variance of the time value Vr = P{T e ) — Pq are given by 
(see App. [Bj 

(V T ) = f * pp(t)dt (4.59) 
Jo 

({VS)}= r° 2 p (t)dt (4.60) 
Jo 

where we have designated 

ftp = (l*p) F i °l = {°p) F ■ 
Substituting Sr(T) into Eq. (|4. 55[) and performing integration we obtain 

(6P)sf = \J rW («) L uu Au ^Yk\\ W W L ™ du dv i ( 4 - 61 ) 

where the integration is performed from t to T e . This result could also be obtained from Eq. (|4.37[) . 
since (see App. [B]) 

(SP) SF = (SS) SF . (4.62) 

Next we observe that r-W(T) = Ar S(C - F(T)) as well as r^(T) = Ar 5'{C — F(T)) are singular 
functions, and the integrals of the type / rW(T) f(T) dT and / A 2 ~>(T) f(T) dT are reduced to a 
summation over the set of trigger times {Ti : F{Ti) = C}. Making use of the identities (|D.4I) and 
(|D.5p we obtain: 

_ Ar ^ Ll {t) Ar 2 ^ L tJ (t) 

^> 2St^\F i (t)\ 2X^)51^^)^^ 

where 

Lij(t) = L{t 7 T h Tj) ■ Li(t) = L H {t) ; 

Kit) = ArS^ . 

This value is stochastic for t > 0. Using the results of Add. [Cland using the notation 
1 

Si 

we finally obtain the average of /i p with respect to the forward curve F: 

Ar^ Ai(t) Ar 2 v-^ A i5 -(t) , A v-^ 1 

M P (t)~ — > ^ ; where K (t) = Ar) — . 

W; 2^\F 0i (t)\ 2K (t)^\F 0i (t)\\F 0j (t)\ f£\Fok(t)\ 

(4.65) 



A<i(«) = tt (Lij) ; Aj(f) := Au(t) ; (4.64) 



Using the results from the App. [C]th e latter summation formula can under some assumptions be 
replaced with the integration, which would allow an easier analytic approximation. 
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4.10 Example calculation 

We apply now the expansion formula to a simple toy example of a storage contract with one factor 
price process (with j3 = 0). The correlation function for this price process is given by 

k{t,T l ,T 2 )=ale- a ^- t '>e- a( - T *-V (F(t, T x ) F(t, T 2 )> « a 2 e -»(*i-t) e -a(r a -t) Fq(Ti) Fo(Ta) _ 
As initial price condition we consider a periodic function 

7T AT 7T 



F (T) =F C + AF sin(u; T) ; w 



T P AT ' 



where T e is the exercise period, N is the number of trigger times and AT is the distance between 
the nearest trigger times. Let also 

C = F C 

be the initial trigger level. We obtain: 

i T„ 



Ti 



N 



Fen — F r _ ; 



\F 0i \ = AFu = n 



AF 
AT 



Fi = 0. 



(4.66) 



Next we substitute 



Ay — <Ti<Jj Fa Foj — T c 2 (JiCFj ; 



where a t = a(t, T) = cr e _a(IW) 



2 ^2 . 

c a i > 



A, = T c ^ a. 



ttAT 

into Eq. f|4.65[) to obtain 
ArFj. 



At ^ — r Ar . 



2ttAT 



• c "0 



2ttAT 

Performing integration we get: 



T.-t 



-2oT 



T-f 



T e -t 



,-aT 



dT 



// p (i) 



ArT 2 a 2 r 
2ttAT 



1 _ e -2 a (Te-t) X _ 2e -«(T e -t) + e -2 a (T e -t) 



2a 



a 2 (T e - t) 



(4.67) 



(4.68) 



For the option time value we finally obtain 

/" T « Ar F 2 a 2 T 2 

v T :=(p(T e ))-p = I n P (t)dt= g ; A ^ e $( a r e ) : 



where we have introduced a function 



1 



= _ [ e ~ 2x - 1 + 2 a; - 4 7 - 8T(0,x) +4r(0,2x) +4 In - 



Here 7 sa 0.577 is the Euler's constant and r(a, z) is incomplete gamma function 



(4.69) 



(4.70) 



T(a,z) 



t a ~ x eT 1 dt . 



2S 



4.10.1 Time Value as a function of a 

The function <i>(x) is bell-shaped with x 2 asymptotics on the left wing as x — > +0 and 1/x asymp- 
totics on the right wing as x — ¥ oo (see Fig. [5]). The function reaches it's maximum approximately 
at x* ps 5.04 and reaches there the value 

$(x*) fa 0.12 

Thus, if we consider the time value Pt as a function of the parameter a (the time T e is kept 
constant), then it reaches its maximum at 



and the maximum time value is 



(4.71) 



Ar F 2 n 2 

V f-WKF T '- < 472 > 

The function $(x) in the whole range can approximately be represented in the form 

*/■ x , , , -5 + e- 2x + 2x + 4e- x (l + x) fltn . 

9{x) « fc x j i '- , 4.73 

x z 

where k(x) could roughly be considered as constant. More precisely, k ps 0.5 to better fit the left 
wing of <J> and k ps 0.9 to better fit the right wing. Around maximum the best fit is achieved with 
k ps 0.6. In the whole range we can take k ps 0.9 — 0.4 e -1 / 18 . 

This approximation results from the replacement 

l-2er x + e- 2x 



in Eq. gjBS]). 

4.10.2 Limiting case aT e <C 1 

For small x the function $(x) has an asymptotics 



$(x)«— , x^0. 



Thus, for small a satisfying the inequality a< the asymptotic time value becomes 

^^Sf^ aT ^°- (4J4) 

We conclude that the storage option time value for a — > is proportional to a 2 . In particular, the 
time value vanishes for a = 0. 

4.10.3 Limiting case aT e > 1 

For this asymptotics we obtain 

,, . -0.5 + 2x-4 lnx 2 fA . 

$(x) w ps - . (4.75) 

x z x 

Thus, for big a satisfying the inequality a» ^- the asymptotic time value becomes 

^"^XF 1 -' «2Woo. (4.76) 
47T a 
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Figure 2: Time Value $ as a function of aT e 



4.10.4 Comparison of analytical results with numerical simulation 

The comparison of the analytical formula for the time value with numerical simulation can be seen 
in the Fig. E 

For this graph we have created a toy storage contract with deliberately reduced influence of the 
surface effects. To a large extent the difference can be explained by the time discretisation effect. 
The analytical formula is obtained in the continuous time approximation, whereas in the real life the 
time is discrete (for instance, gas delivery is usually nominated once a day, and hence the smallest 
time step is one day). The discrete time system has less trading opportunities than the continuous 
time system, and hence its exercise is "suboptimal" from the continuous time perspective. For this 
reason the discrete time system has slightly smaller time value than the continuous one. 

Other sources of error are surface effects (boundary influence), higher order corrections of the 
exercise strategy and some approximations used to simplify the analytic expression. 

4.11 Swing option 

In the previous sections we have considered the storage option problem with fixed terminal con- 
dition. This condition has a significant impact on the option value. Indeed, this is simply an 
additional constraint which restricts the possible strategies and hence can only decrease the option 
value. Although in most of the storage contracts there is no restriction on the possible terminal 
level, the problem still can be considered as having the fixed terminal state constraint. The reason 
for that is simple. The derivative of the storage value with respect to the terminal state is given 
by Eq. (O 

dS = c 

dq en d 
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Figure 3: Time Value $ as a function of aT e . Analytic solution is compared with numerical 
simulation. 



where C is the trigger price of the last part of the trajectory. If the prices F(T) are strictly positive, 
the trigger price can only be positive as well, since the forward curve should cross the trigger level 
at some poin10. Thus, it is always profitable to have at the end as little gas in the storage as 
possible within the constraints boundaries. It means that the optimal trajectory has to terminate 
at the lower boundary, and hence the problem can be considered as having the fixed terminal state. 
If we variate the forward curve F — > F + 5F, this will lead to variation of the optimal strategy. 
However, the new strategy preserves the terminal level. 

There are examples of the contracts for which this logic is not applicable. For instance, if the 
remaining gas can be sold at the fixed price F e , then the derivative of the storage value with 
respect to the terminal state is given by Eq. (|3.2ip 

dS R-C-0 



dq en d 

from which the condition (|3. 22!) 

C = F R 



follows. The vanishing derivative implies that the trajectory may terminate anywhere between 
the upper and lower boundaries. The variation of the forward curve would lead to an "arbitrary" 
variation of the optimal trajectory which does not preserve the terminal state. 

Another more common example is a swing option. Usually a swing option is a gas supply contract. 
The option holder is given a right to purchase gas from a producer at some "contract" price. He 
would then sell the gas on the market at the "market" price. Thus, the effective price for the 
option holder is the spread between the "contract" and the "market" price. This spread can be 

4 Strictly speaking, we can construct some extreme example, when the trigger price can be zero. For instance if 
the storage at the beginning is fool and release costs are so high that for some periods of time it is profitable rather 
to do nothing than to release gas. Here we do not consider such examples. 
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positive or negative. A typical swing contract obliges the option holder to take some amount of gas 
within the contract period, however the option holder has some flexibility: the total taken amount 
can be anything within the constraints - minimal and maximal total taken volumes. The option 
holder also has some flexibility during the contract time to take bigger or smaller amount of gas on 
particular days, making maximum of the opportunities to earn on the spread. The swing contract 
can be easily formulated in terms of a storage, and hence possesses all features of a storage option. 
However the matter of fact that the spread between contract and market prices can be negative, 
makes the swing option having slightly different peculiarities. 

The difference between the swing and storage option is rather relative. One can find examples of 
swing options having all the features of a storage option. If the swing option is deep in the money 
(i.e. the market prices are much higher than the contract prices), then the option holder is most 
likely to take the maximal allowed amount of gas, making the optimal trajectory to terminate at 
the lower boundary. Similarly, if the option is deeply out of the money, the optimal trajectory is 
very likely to terminate on the upper boundary. However if the swing option is at the money, it 
is quite probable that the total taken amount of gas will be between maximum and minimum. In 
this case the swing option problem becomes a problem with free terminal state. 

This section is devoted to the problem with the free terminal condition. We will show that this 
problem has a different time value and derive the time value in a similar way it has been done for 
the problem with fixed terminal state. 

The problem with free terminal state has an important feature. As it was shown in the Sec. (|3.7.2[) . 
the derivative of the option value with respect to the terminal volume must vanish. This leads to 
the condition on the trigger price C = F e . Since there is no final unit price in the swing contract 
we conclude that for the problem with free terminal condition 



follows. These conditions will simplify the derivation of the option value evolution equation. In 
particular, the variation of the target function becomes (compare with Eq. (|4.37|1 ) 



The second term gives a non- vanishing part of the average option value increment. If we compare 
this formula with Eq. (I4.37|) we see that SS of the problem with fixed terminal state contains an 
additional negative term. Hence the averaged growth rate of the option value of the problem with 
free terminal state is higher than that of the problem with fixed terminal state, as expected. 

The evolution equation for the option time value becomes 



C = 0, 



from which obviously the condition 



6C = 




(4.77) 



dP(t) 



jJL p (t) dt 



+ 



a p (t)dW t ; 



with 




(4.78) 



Repeating the calculation in the previous sections we finally obtain 




(4.79) 
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4.11.1 Example calculation 



Let the evolution of the spread be described by a Normal Brownian one-factor price process 

dF{t,T) = n(t,T)dW t ; K(t,T) = Koe-^-V ; (4.80) 

Note that a natural choice for the price process describing the evolution of gas forward curves is a 
geometric Brownian motion, whereas for the spreads evolution a plane Brownian motion is a more 
suitable choice. Unlike the volatility a of the geometric Brownian motion, the Normal volatility k 
includes additional dimension of price. 

The correlation function of the price process (|4.80[) becomes 

A«(t) = j t (dF(t, Ti) dF(t, Tj)) = z^e"^"') e -«( T i-*) . (4.81) 
Let the initial forward curve be given by 

F (T) = AF sin(c T) ; u = ^ = , (4.82) 

Repeating the calculation similar to that in the previous section we obtain the swing option time 
value 

At- k- 2 T 2 p -2x i O ~ _ 1 

v ^ = ^x^^ aT ^> where ^ = ~ — ~r^- ( 4 - 83 ) 

This formula resembles the formula of the time value of a storage option (|4.69[) , where the volatility 
k replaces the product F c ctq . The asymptotics of the swing option time value for the big mean 
reversion parameter coincides with that for the storage option: 

Ami T e 

However in the limit of small mean reversion the swing option time value reveals different properties: 
ArnlT 2 / 4 \ aT _> . ( 4 . 85 ) 

For a, = the time value reaches its maximum 

and on the entire range the time value is a decreasing function of a (See Fig. 0]). 

This result can be easily interpreted. The main driver of the storage option time value is the 
change of exercise strategy. Since any change of the exercise strategy must preserve the terminal 
volume level, the storage option becomes a zero sum game: any decision to inject more at some 
time moment must be compensated by the decision to release more at some other time moment. 
Hence the storage option is in some sense similar to a set of time spread options. The only way 
the time spread option can increase its value is an asynchronous movement of different parts of the 
forward curve. In the limit a — > the forward curve moves in a parallel way (in log scale), and 
hence does not lead to any change of the exercise strategy, i.e. to the increase of the time value. 

On the contrary, the swing option is similar to a strip of european call options. Indeed, since there 
is no terminal level preservation condition, any change of the strategy at one (delivery) time has 
no impact on the rest of the trajectory. If for instance we hedge the exercise trajectory at the 
very first observation time moment t — 0, then the change of the exercise decision for any time T 
would make a profit. The pay-off of the exercised volume on the time t will have a structure of the 
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Figure 4: Time Value $ as a function of aT e . Analytic solution and numerical simulation. 



european vanilla option. The effective square volatility of the option with exercise time T is given 
by 

f = f T <7 2 {T,T)dT. 

Jo 

It's easy to see that the biggest effective volatility <r 2 is achieved for the flat volatility term struc- 
ture a = 0, thus leading to the highest possible time value of the swing option. 

This has a strong impact on the way the price process for the forward curve evolution should be 
calibrated. Let us consider a price process in the form 

dF{t.T) = m{t,T) dW t {1) + K 2 (t,T) dW t (2) ; (4.87) 

where dW^ and dw/ 2 ' are (possibly correlated) standard Brownian processes. Let also «i and 
k 2 be exponential functions, describing the short term and long term volatilities 

ki = kio e- Ql ; n 2 = k 20 e~ Q2 ') . 

Here the mean reversion parameter a± is relatively big (compared to 1/T e ), i.e. the term K\ 
describes the short term volatility. The second term n 2 describes the long term volatility, which 
means a 2 <C 1/T e . The correlation function decouples into three terms 

A« = i (dF(Ti) dF(T 3 )) = Ag> + Ag 2) + Ag> , where 

= (F(T l )F(T 3 )) Ki(Tj) Ki(Tj) ; 
Ag^^TOF^)) K 2 (Ti) K 2 (Tj) ; 
Ag 2) = p (F(Ti) F(Tj)) ( Kl (3i) ks^) + «a(T0 ^(T,)) • 
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Since the option time value (both storage and swing) is a linear functional on the correlation 
function, the time value also splits into three components, which can be estimated independently. 

For evaluation of the storage option, the second and third terms will give no contribution, since 
they correspond to the long term volatility (a — > 0). Consequently for the storage option the 
second term in the r.h.s. of the price process can be neglected. 

This is not the case for the swing option. Since both - long term and short term volatilities 
contribute to the option time value, we can not neglect the long term volatility in the price process. 

5 Discussion 

In Sec. [3] we considered a deterministic storage problem and have fount its solution in an implicit 
form. The solution provided by Eq. (13.81) contains a free parameter - trigger price, which can be 
found from boundary conditions. The obtained solution reveals the key features of the intrinsic 
optimal exercise strategy. 

• The optimal exercise is bang-bang. There is a "dead zone" around the trigger price. The 
optimal strategy is to inject at maximum rate if the price is below the zone, release at 
maximum rate if the price is above the dead zone and do nothing if the price is within the 
dead zone. 

• The width of the dead zone is defined by the operating costs. 

• The trigger price is constant for every piece of the trajectory where it does not sticks to 
the boundary. Different pieces of trajectory separated by the boundary touch may have 
different trigger level. If the trajectory touches the boundary in the interval t £ (0,T e ), the 
conditions (|3.1H3.1^|) must be satisfied. 

We also considered an impact of different constraints on the solution. In particular we found that 

• the cycle constraint is equivalent to additional injection/release costs. 

• the carry cost preserves the bang-bang property, but makes the trigger price growing with 
the time at the rate of the carry cost. 

• The volume dependent injection/release costs lead to a time-dependent trigger level, but 
preserve the bang-bang property. 

• Fir the price of the underlying is strictly positive, and there is no final unit price, then 
the storage problem (both intrinsic and stochastic) can be considered with fixed terminal 
condition. If there is a positive final unit price, or if the prices of the underlying can be 
negative, then the optimisation problem becomes a problem with free terminal condition. 
This problem has an additional condition on the trigger level (|3.22p for storage or (|3.23[) for 
swing option. 

In Sec. H] we developed a perturbation analysis to find the solution of the stochastic problem in 
rolling intrinsic approximation. 

First important result obtained in this section finds the relation between the intrinsic and stochastic 
solutions. In particular we show that the stochastic optimal exercise has exactly the same bang- 
bang feature with a trigger level and a dead zone. The stochastic trigger level is given by the 
averaged intrinsic trigger price (Eq. (|4.27|1 ). where the average is taken over all realisations of the 
spot price process. Under the condition (|4.29[) the average trigger price equals approximately the 
current intrinsic trigger price (conditional on the current initial condition and forward curve) . This 
condition defines the range of applicability of the rolling intrinsic approximation. 
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Next we show that the hedge position has no impact on the expected option value. The only 
relevant part of the hedge strategy is the instant (prompt) exercise at the current time. This 
justifies the assumption that the intrinsic exercise profile as a good approximation of the optimal 
stochastic hedge profile, and generally, the rolling intrinsic strategy is a reasonable approximation 
for the optimal stochastic strategy. 

By applying a variational analysis to the intrinsic solution we obtain the option greeks - delta 
and gamma. This allows us to find the option time value. Making use of the option gamma we 
derive a stochastic differential equation governing the evolution of the option value. The drift term 
(Eq. (|4.65[) for storage and Eq. (|4.79D for swing option) can be integrated yielding the option time 
value. 

We apply the obtained time value formula for a toy examples of the storage and swing contracts 
with a simple 1-factor price process. The storage option time value (Eq. (|4.69[1 ) is a bell-shaped 
function of a, approaching zero for a — ¥ and a — > oo, and reaching its maximum at aT e w 5. 
The swing time value appears to be a decreasing function of a for the entire range, reaching the 
maximum for a = and approaching zero for a — > oo. 

The option time value appears to be a linear functional on the prices correlation function. From this 
fact we conclude that if the correlation function can be decoupled into independent components, 
the option time value can also be decoupled. This allows us to investigate effect of different 
components of the price process on the time value independently. 

The difference in the limiting case a — > between swing and storage option time value has an 
important consequence. This limiting case corresponds to the "long term" volatility due to the 
flat shape of the correlation function in this limit. If we consider a price process, whose correlation 
function could be decoupled into short and long terms, the impact of the long term component can 
be investigated independently. We conclude that the long term volatility has a significant impact 
on the swing option time value, but has no impact on the storage option time value. This fact can 
also be used in the price process calibration procedure. 

Last important comment concerns the applicability of the obtained results. Throughout the deriva- 
tion process we used a number of simplifying approximations. In particular, the rolling intrinsic 
approximation is valid for sufficiently small volatility. For the derivation of the option time value 
we also have neglected the surface effects (possible influence of the storage being completely full 
or empty), the bid-offer and operating costs, and used rather simple time- independent option 
constraints. For the derivation of analytic formulas of the time value we also implicitly used an 
assumption about sufficient smoothness of the forward curve, which justified the use of Taylor 
expansion. Another significant approximation was made for the definition of the swing option, 
where we have allowed the option holder to take any possible amount of the underlying limited 
only by the maximum release rate. 

All these approximations restrict the applicability of the obtained results. However all the quali- 
tative results remain valid for much bigger range of parameters. And the main results can still be 
used for understanding the option value driving mechanisms and for building an intuition about 
the influence of different factors on the option value. 

Suppose that we are considering a real storage and we are interested in the applicability of the 
option value formula. For instance, we find that the intrinsic strategy spends a significant time on 
the boundary. This means that we can't neglect the surface effects, and the option value formula 
is not applicable directly. However, as the experiments show, all the major qualitative results still 
remain valid, and the option value formula could be corrected by some multiplier, which is weekly 
dependent on the storage parameters. Thus evaluating the storage for one set of parameters, we 
can easily predict how the value would change for a different set of parameters. 

Another example of the restricted usability of the option value formula is the swing contract. A 
real swing contract usually has two global constraints - maximum and minimum volume which 
can be taken during the contract period. These constraints were not taken into account in the 
swing value formula, which allowed us to use the free terminal condition approximation. However, 
if the option is "at the money" , the probability is very high that the total taken volume will 
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be between the maximum and minimum, and thus satisfy the requirement of the free terminal 
condition. Hence the swing formula remains a good approximation for the at-the-money option. 
The swing options which are either deep in- or out-of-the-money, are very likely to finish on either 
lower or upper boundary of the total taken volume. Such an option satisfies the condition with 
fixed terminal level, and hence a storage formula can be used to estimate the time value of such 
an option. An intermediate case, when the swing option is only slightly in- or out-of-the-money, 
neither storage nor swing formula become directly applicable. However, one can see that these two 
formulas allow a continuous transition, since they only differ by one term, which is not present in 
the swing formula. Thus the ultimate formula could be easily adjusted for the particular swing 
option. 



Appendix 

A Derivation of Constraint Surface equation 

To find the equation of the constraint surface we vary the equation 14.171 Since F(t,T) follows a 
Wiener type stochastic process, so does the C(t). Hence we find the variation of the Q en d up to 
the second order in SF and SC: 

dQ md =(q-r(t)) dt+^f5C + J ^SF(T)dT + SC J ^(T) SF(T)dT+ 

, 1 d 2 Q end S(j2 + 1 re J^Qend 5F{u) 5p{v) dudv = Q {A1) 



2 dC 2 2JJ SF{u)SF(v) 

On the optimal intrinsic trajectory q(t) = r(t,t), and the first term in the r.h.s. vanishes. 

We introduce the notation: 

rW ( t , T) = Ar 5(C(t) - F{t, T)) ; (A.2) 

r ( V (t, T) = Ar 6'(C(t) - F(t, T)) ; (A.3) 

K{t) = J \ {1 \t,T)dT; (A.4) 

M(t) = p ' r P>(t,T)dT, (A.5) 

where S(x) is the Dirac delta- function, and S'(x) - its derivative. Making use of Eq. (|4.1j) and the 
notation above, we find 

dQend K{t} . (A g) 



dC 

^ Qend 

dC 2 

)Q "' d =-r«(t,T); (A. 



M(t) ; (A.7) 



5F(T) 

8 Qend = (2) ( . T) . (Ag) 

dCSF(T) l ' >' 1 ' 



S"Qend ^(2) 



SF(ti)SF(t2 
We thus obtain a quadratic equation on SC 

i-T, 



rW(Mi)%-<a). (A.10) 



5C=j£ (J \W8FdT-±f \^SF 2 dT + SC J ' SF dT - ™ SC 2 ) . (A.ll) 
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Here for simplicity we omitted arguments of the functions: r — r(t, T), 5F = 6F(t, T), K — K(t), 
M = M(t) and SC = SC(t). 

We search the solution in form of a series. As a zeroth order solution we take 

SC = . 
The first order solution becomes 

SCl = in r « 8FdT-±J r< 2 > 5F 2 dT^j 

Substituting SCi into the r.h.s. of Eq. (|A.11I) . and leaving only 1st and 2nd order terms we finally 
obtain 

*C=Jc{J WdT-\j r« 8F 2 'iff + 1 J 'J r« L uv - ^ J J r« r« L uv ) ; 

(A.12) 

SC2 = ^JJ W W L ™ , (A.13) 
where we have used the notation 
L U v = SF(u) SF{v) . 



B Portfolio dynamics and time value 

In this section we show how the option time value can be obtained from the dynamics of the 
portfolio components in the rolling intrinsic strategy. 

We start with summarising some basic facts from the option pricing theory. Let F be a price of 
some underlying with delivery at time T, and let f(t,F) be a price of an option expiring on T. 
Let the price be following a geometric Brownian motion (for simplicity we disregard the interest 
rate), which in Ito representation reads: 

dF 

— =a(t)dW t . 
Using Ito rule we can find a differential of the option price: 



According to the non-arbitrage condition average of this expression must vanish: 

(df)=0. 
Noticing that (dF) = we obtain 

4^ + 10^ = 0. (b.i) 

or substituting dF 2 explicitly 

This equation (up to the interest rate terms) is known as Black-Scholes equation. For us it is 
essential that the operator 

r) 1 f) 2 - 1 . r) « rl 2 

B = ^ + -a 2 F 2 ^ = d+-a 2 F 2 T, where §=■?-: ~ 



dt 2 dF 2 2 ' dt ' dF 2 1 
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applied to any martingale process vanishes: 
Bf = 0. 

Designating the operator of the full differential as 

d= *d + d F±=*± + dt\o>l?lL + 4F± (B.3) 
we find 

df df 

df = dtB f + dF = T^-dF; IB A) 

dF OF K ' 

Next we derive the expression for the differential of the option Delta dSpp. To do that, we notice 
first that the operators d and d/dF do not commute. Using the obvious commutation rule 



d >dF 



d- — d=-dta 2 F 



dF dF dF 2 



we find 

dF dF~ J " " dF 2 ™" dF 2 ™* " * dF 2 
B.l Dynamics of the portfolio components 



d% = ^df-^F^l d t=^l d F-^F^l d t; (B.5) 



Let us consider a standard situation when an owner of an option hedges it with a combination of 
linear products. The full portfolio consists of three components: an option expiring on T, a hedge 
sub-portfolio, consisting of linear products, and a cash account. Let 11(f) be the value of the full 
portfolio at the observation time f . Then we have 

11(f) = /(f) +tf(f) + P(f), 

where /(f) is the option value, H(t) is the value of the hedge portfolio, and P(t) is the balance of 
the cash account. 



The dynamics of the option value / is given by Eq. (IB.4|) 

Next we find the dynamic equation for other portfolio components. Let us consider a hedge 
approach, when the hedge portfolio consists of some amount of underlying h, i.e. hedge portfolio 
contains only linear products. The value of the hedge portfolio equals the current price of the 
hedge volume: 

H = Fh. 

In a classical scheme we may think of the option / as a vanilla option on some kind of shares. In 
this case the hedge consists of some amount of shares which is purchased prior the expiry date. 
The same logic can be applied to a commodity market. For instance / could imply an option to 
purchase some amount of gas. In this case hedge consists of forward contracts with delivery on the 
option expiry date. Although strictly speaking, the cash-flow associated with the delivery on the 
forward contract can take place on a later stage, we assign the cash-flow to the current (observation) 
time f, and hence the value of the hedge portfolio reflects the total price of underlying, and not 
only of the forward contracts. 

The hedge portfolio is a product of two stochastic variables. Using Eq. (|4.51[) from the main text, 
we find 

dH = hdF + (F + dF) dh. (B.6) 
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The latter equation can be formulated as retarded action principle. This principle follows from 
a physical meaning of the dynamic hedge: the change of the hedge volume takes place after the 
change of the price is observed, and the additional hedge volume dh has to be purchased at the 
new price F + dF. Notice however that the retarded action principle does not require any special 
differentiation rule. 

The dynamics of the cash-flow component is easy to find. The cash-flow appears due to the change 
of the hedge volume: 

dP = -{F + dF)dh. (B.7) 

The increment of the portfolio value is given by 

Of 



dn = df + dH + dP=\^p + hjdF. (B.8) 

First we notice that if we use the delta-hedge, i.e. if the hedge volume equals the option delta with 
the opposite sign 

k = -dF> 

then, as expected, the variance of the portfolio equals zero: 

dU = df + dH + dP = . 

Next important observation can be made about the relation between hedge portfolio and cash-flow. 
Averaging the increments dH and dP with respect to the stochastic price increment dF we obtain: 

(dH) dF = - (dP) dF , (B.9) 

where is the averaging over the price increment. 

B.2 Exact delta-hedge 

Let us consider the case when the hedge volume equals the exact (negative) option delta 
df 

h = -f' = --^=; H = —Ff , 
dF' 1 ' 

Using Eq. (|B.5[) we get for the hedge increment: 

dH = -f dF-F df ~ dF df = -(/' + F f") dF . 
Note that on average the hedge portfolio value and the cash account do not change: 

{dH) = ; (dP) = . 

B.3 Intrinsic target function and time value 

Here we show how the time value of a vanilla option can be calculated from the intrinsic target 
function. Let Aj nt be the intrinsic option delta, i.e. the delta calculated at the option maturity 
time 

Ajnt^J - — . 

Similarly we define the intrinsic gamma 
d*f(F,T) 
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The intrinsic target function Si n t is defined such that at every time moment it is equal to the 
intrinsic option value 

S int (F) = f(F,T) . 

The variation of the target function is given by 

5S int = A int 5F+^T int 5F 2 . (B.10) 

Note that the increment 5F is independent, and hence 

{6S int ) = ± (T mt SF 2 ) . 

Integrating SSm over the time and averaging over the realisations of the stochastic price process, 
we get 

T 

(SS mt ) = (S int (F(T))) - S int (F(0)) . (B.ll) 

Next we observe that Si n t(F(0)) is the intrinsic option value at time t = 0, and the expectation 
(Sint(F(T))) is nothing else but the option value at time t — 0, and thus the latter integral equals 
the option time value Vt- 

V t = £ (6S int ) = ±£ (r mt 5F 2 ) . (B.12) 



B.4 Intrinsic delta-hedge of the storage option 

A storage option can be considered as a sequence of simple trades of buying and selling some 
amount of underlying. The logic of the derivation of a vanilla option time value can be applied 
to the storage option, if we consider it as a strip of (correlated) vanilla options with a pay-off 
—q{T) F(T) dT, where q(T) dT is the volume purchased at time T for the delivery period dT at 
the spot price F(T). 

For every delivery time T there is a forward price F(t, T) which is as a function of the observation 
time t. Suppose we have an optimisation model, which calculates an intrinsic exercise profile 
qint(t,T) based on the current forward curve F(t,T). Then qi nt (t,T) is the volume of underlying 
(per delivery period dT) to be traded on time T. The value 

s(t,T) = -q int (t,T)F(t,T)- t<T 

can be interpreted as a target function density for the trade on the delivery time T. The same 
function after expiry is constant 

s(t,T) = -q int (T,T)F(T,T); t>T. 

Now the storage option target function can be expressed simply as an integral of the target function 
density over the delivery time 

S(t) = f T ° s(t, T)dT = — f q int (T, T) F(T, T) dT - f' q int (t, T) F(t, T) dT (B.13) 
Jo Jo Jt 

The first integral in this expression represents the value of the closed trades, whereas the second 
integral represents the future profit calculated with the current forward curve and exercise profile. 
At the last time moment t = T e the second integral vanishes, and the target function equals the 
cumulative cash flow from the closed trades. Note that the cumulative cash flow only depends on 
the exercise volumes qi nt (T,T), and not on the hedge volumes q int (t,T) for t <T. 
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Now we apply the logic of the section IB. 31 in order to calculate the storage option time value. 
Obviously the target function calculated at the time t — yields the intrinsic storage option value 

S(0) = v mt . 

The expectation value of the terminal target function value gives the true option value 

(S(T e )) = V 

(this statement relies on the fact that the intrinsic exercise is an optimal prompt exercise, which 
has been justified in the Sec. I4.5|) . and hence, the time value can be obtained as 

V T = / " (dS) , (B.14) 



where dS is a full differential of the target function. 

Some comment should be made regarding the calculation of the target function differential. First 
we split the target function into two parts: 

S = S c S , 

where 

S c = - f q m t(T, T) F(T, T) dT (B.f 5) 

Jo 

represents the "closed" trades, and 

S a = - J " q int (t,T)F(t,T)dT (B.16) 

represents the open trades, i.e. the remaining "future" option value. The definition of the target 
function in the main text coincides with the second "open" component of the target function defined 
above. However we can show that the open target function can also be used for the derivation of 
the time value. 

The variation of the target function should be done with respect to the forward curve F and 
time t. The target function S is an explicit function of the forward curve and of the time, and the 
dependence on the time is due to the limits of the integrals. It is easy to see that the time variance 
of the closed component equals in absolute value the time variance of the open component but has 
an opposite sign: 

dSc dSo 

at at 

Thus the target function can be considered as an explicit function of the forward curve only. The 
full differential includes only the differentiation with respect to the forward price: 

dS = J §dF(u)du + 1 - J SF ^ S SF{v) dF { u)dF( V )dud V 

The closed part of the target function is constant with respect to the forward curve variation (the 
variation of the forward curve impacts only the future values of the curve F(t, T) for T > t), and 
hence 

dS = dS , (B.I7) 
where we designated S the variation which should be calculated for constant time t. Thus 

V T = / ° (BS ) . (B.I8) 
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It is interesting to note that for the storage option the hedge value coincides with the (negative) 
open target function: 

H(t) = J ' q int (t, T) F(t, T) dT = -S , 
and hence 

(SS ) = - (SH) = (dP) (B.19) 

where P is the cumulative cash flow from the hedge trades. Note that this relation would not be 
valid for a vanilla option, since in general case the target function can not be expressed as qi nt F. 



B.4.1 Cumulative cash flow 

There is another way to see how the time value can be obtained from the cumulative cash flow. By 
definition the cash flow (resulting from the hedge trades) with delivery on T per delivery interval 
dT is given by 

dP T (t,T) = -(F + dF)dh: t<T; where h = q mt . 

dP T (t,T) = 0, for t>T. 

The value Pr has a meaning of the cash flow density over the delivery time. The total cash flow 
increment at time t is related to dPr as 

dP{t) = f ' dP T (t,T)dT, 
Jo 

from which the total cash flow follows as 

P(t)= [ dr [ " dPx (t, T) dT = [ dT [ ' dP T (T,T) dT . (B.20) 
Jo Jt Jo Jo 

Both Pt and P should be considered as functions of time t with T been a parameter. 

Integrating dPr{t,T) from to T and noticing that Pt(0) = 0, we get the cumulative cash flow 
per delivery period [T, T + dT}: 

i-T i-T 



Pt(T)= I dP T = - f (F + dF)dh. 
Jo Jo 



Now we use the integration by parts, which in our case can be represented as 

f T f T f T 

{Fh)\a= / hdF+ / Fdh + / dFdh (B.21) 
Jo Jo Jo 

For the averaged cumulative cash flow we obtain 

(P T (T)) = F(0) h(0) - (F(T) h(T)) = F(0) q int (0) - (F(T) q int (T)) . 
The total average cash flow is given by integrating (Pt) over the delivery time: 

(P(T e )) = J * (P T (T)) dT = j q(0, T) F(0, T)dT-Q q(T, T) F(T, T) dT 
Noticing that 

q(0,T)F(0,T)dT 



!■ 
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equals the option intrinsic value, and 
J q(T,T)F(T,T)dT 



equals the true option value, we conclude that 

V T = (P(T e )) = f ' (dP) . (B.22) 



C Averaging an integral with delta function 

In this section we consider an averaging of the integrals of the type 

/(*)= [ T ° 5(C-F)ct>(T,F)dT, (C.l) 



where F — F(t, T) is the forward curve - a stochastic function of observation and delivery times, 
and 4> is an arbitrary function of T and F. We will be interested in the average of the integral over 
the stochastic prices (I(t)} F . 

Due to the delta-function under the integral we can write 

/(*)= J " ' 5{C-F)ct>{T,C)dT, (C.2) 

Introducing the set of trigger times {Tj : F(t, T,) = C} we can calculate the integral I(t) for every 
observation time: 



I(t) = 7$7 > where & = C); Fi = C; ft- <>l ' ' ' T ' 



; (C3) 

T=Ti 



This integral is a stochastic variable. Indeed, the process F(t,T) is stochastic, and hence, the set 
of trigger times as well as the derivatives F% are stochastic. 

The average over the stochastic function F is given by 

(/(*)> = (S(C - F)) <f>(T, C) dT (C.4) 
Here we have assumed that C is constant. 

The average of the delta function can be easily found using Fourier transformation: 

5(C - F) = -L / ( C - F ) du (C.5) 
2vr J 

Making use of the cumulant expansion and keeping only the two first cumulants (which, strictly 
speaking, is only exact for Gaussian variables) we find the average of the exponential: 

(e l ^ c -^)«exp^(C-^ )-^ 2 a 2 ) , (C.6) 

where we used 

(F)=F ; ((F 2 ))=a 2 . 



Substituting the averaged exponential into Eq. (|C.5|) and performing integration we obtain 

1 ( (C-, 
== exp — 



(c,(C-F)) = — L=expf- (C o f r ) . ,r.7. 



44 



For the averaged integral we get: 

(7(f)) « / ' — ^= cxp f- (<7 ~f° )2 ) 0(T, C) dT . (C.8) 



cr\/27r V 2(7 
Note that both To and a are functions of t and T: 

F = F Q (t,T); a = a{t,T) 

The exponent under the integral reaches its maximum at the trigger times {Tj}, when 7b(Tj) = C. 
Expanding the forward curve around this value 

F (T) ^C + F (Ti){T-Ti) 

we notice that for each trigger time averaged delta function becomes a Gaussian bell (as a function 
of time T) of the width er t , which can be found from an approximate relation 



^o(T)| 



C.l slow approximation 

Suppose that the trigger times are distributed rather uniformly on the time axis. Then we may 
speak about characteristic time distance between the trigger times. Let AT be the average trigger 
time step 



AT ~ (T l+1 - T) . 

We say that function cj)(T, C) is slow if it changes insignificantly within the average trigger time 
step: 

where j> = JL$(T,C) , 

Let us also suppose that the time derivative F\ can be considered as constant plus maybe a small 
random correction: 

\Fi\ = F + Ci, M«T - 

In this case we can replace the summation in Eq. f|C3[) with the integral: 

7(f) = S(C - F) <t>(T, F) dT = ^7 « i r m C) dT (C.9) 
Jt i>t \*i\ |i*o| Jt 

where we have assumed that the derivatives \Fq\ can be considered as constant. 

The same approximation can be used for the averaged integral (|C.8|) . The biggest contribution to 
the integral is made by the intervals of the width ~ 2 er t around the trigger times 

If the volatility of the forward price process is not too big, so that 
o-t ^ AT 
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then the function (f> is slow with respect to the width of the Gaussian bell around the trigger times 
in the integral (|C.8[) 



4> 1 

In this case the function (j) can be roughly considered as constant on every integration interval 
(Ti — &t,Ti + cr t ). We thus obtain 

</(*)) « £ 0(Ti, C) / T ' +fft — J= exp (- {C ;f H dT . 

The volatility a as a slow function of the delivery time and can be considered as constant under 
the integral. Substituting in the first order around the trigger price Fq(T) w C + Fo (T — Ti) we 
can estimate the integral over one trigger time as l/|Fo(Ti)|j an( i the averaged integral becomes 

This means that even for the averaged integral under the specified assumptions the same summation 
formula can be used as for the stochastic integral (IC.3p . 

If we demand that the function (j)(T, C) is a slow function of time not only with respect to the 
characteristic time at but also with respect to the time interval AT 



4>{T t+1 ,C) - 4>{T t ,C) 



< 1 



and if |Fo(Tj)| can be considered as constant, then we can roughly estimate 

< J (*)> w 7157 / mC)dT. (C.ll) 
This approximation is used in the example calculation of the storage option time value. 



D Some formulas used in the main text 



For the arbitrary sufficiently smooth functions f(t) and tp(t) the following identities are easy to 
prove: 



[°° 6(<p(t))f(t)dt = Y,r 

J-oo If 



S'(ip(t))f(t)dt = - 

/OO 1 



<P(U)\ ' 

%(*)) i 



fit) 



<p(t) 

d 1 d 



dt 



^ 1 d 



fit) 



tp(t) 



t=u 



\ip(U)\ dt tp(ti) dt 

where the sum is over all points {ti : <p(ti) = 0}. 
In particular for the case (p(t) = C — F(t) we have 



<p(t) 



f°° 8{C-F{t))f{t)dt = Y,i 

J-oo \r 



fiti) 



/°° 1 d 

jic-F^mdt^^- 



M 

F(t) 



(D.l) 
(D.2) 
(D.3) 



(D.4) 
(D.5) 
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Making use the definitions from the main text (we omit the index t where it is unambiguous) 

r(t, T) = Ar S(C(t) - F(t, T)) = Ar 6(C - F(T)) ; (D.6) 

r (1) (T) = ArS'(C-F(T)); (D.7) 

r (2) ( T ) = Ar _ F(T)) ■ (D.8) 

it is easy to prove the following identities: 

y («) F(u) du = CK; (D.ll) 

/" r< 2 > («) F(u) du = K + C M ; (D.12) 

y r< 2 > (u) F(u) /(«) - y rW (u) /(u) du + C J r< 2 > (u) f{u) du ; (D.13) 

y r< 2 > («) F(u) L(«, u) = y (u) L(«, u) du + C y r< 2 > (u) L(u, u) du ; (D.14) 
y ^ r (2) (u) r (1) (v) L(u, v) dudv = jj r (1) (u) r (1) (u) L(u, v) du dv+ 

+ C J J r (2) («) r (1) (u) L(u, w) , (D.15) 
where / (u) is an arbitrary sufficiently smooth function. 
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